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^ (54) Title: TREATMENT OF CANCER 



(57) Abstract: The product of a gene located at human chromosome 21ql 1-21 has ubiquitin specific protease activity and binds 
to ubiquitin, polyubiquitin, ubiquitin-like protein SUMO-3, and proteins which are implicated in the repair/excision of DNA. It is 
believed to have a role in regulating ceU growth, cell growth arrest and/or apoptosis and is implicated in carcinoma growth. The 
gene has been sequenced and cloned into a microorganism and the product's effect on cell growth is to be investigated. The gene 
has GenBank accession number AF1 342 1 3 and has the HUGO-approved name USP25. 
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Treatment of Cancer 

The present invention relates to the use of the product of the USP25 
gene, which appears to have ubiquitin specific protease activity and may 
have protease activity on ubiquitin-like proteins, located at human 
5 chromosome 21 q1 1 in the treatment, prophylaxis or diagnosis of cancer, 
especially solid tumours, more particularly lung cancer. 

Ubiquitin mediated proteolysis by the 26S proteasome is responsible 
for the physiological regulation of the levels of many proteins which are key 
cell cycle and cell growth regulators (Hochstrasser 1995), as well as known 
10 tumour suppressors (Lane 1998). 

ubiquitin is a 78 amino acid polypeptide present in all eukaryotic 
cells, and highly conserved in evolution. Ubiquitin is conjugated to a target 
protein through an isopeptide bond between the e-amino group of Lys in a 
target protein (ubiquitination), a process mediated by three groups of 
15 enzymes: ubiquitin activating enzymes (E1 ), ubiquitin conjugating enzymes 
(E2) and ubiquitin ligases (E3). Ubiquitinated proteins exist in a 
monoubiquitinated form, or a multiubiquitin chain: the former is not a 
degradation signal, while the latter, Lys-48-linked ubiquitin-ubiquitin(n) 
conjugate, works as a strong degradation signal when joined to a Lys in a 
20 target protein. Protein conjugated to polyubiquitin is then very rapidly and 
efficiently degraded by non-lysosomal, ATP-dependent degradation by the 
26-S proteasome. The de-ubiquitinating enzymes (DUB's) broadly fall into 
two classes; ubiquitin specific proteases (USPs) and ubiquitin-C-terminal 
hydrolases (UCHs) are each capable of de-conjugating the ubiquitin- 
25 ubiquitin and ubiquitin-protein links, thereby converting polyubiquitin into 
mono-ubiquitin, and de-coupling (deubiquitinating) ubiquitin from the target 
protein, with the result of preventing the degradation of the target protein. 

The class of UCH enzymes tends to include relatively small proteins 
(about 35-40kD) which have low specif ity for ubiquitinated proteins. BAP1 is 
30 a UCH but is unusual in that it is larger, having a weight of nearly 100kD. 
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Several such enzymes are known, the sequences of which show some 
sequence homologies especially in two domains, the Cys and His domains. 

Very broadly, two main functions have been observed among the 
various members of the USP (UBP) superfamily (Wilkinson 1997, Wilkinson 
5 and Hochstrasser 1998). The first is the generation of free ubiquitin from 
precursor fusion proteins or from peptide-linked polyubiquitin after 
proteolysis of the targeted protein, and the second is de-ubiquitination. 
When a protein is targetted for ubiquitin mediated degradation, it is linked to 
ubiquitin via an isopeptide bond between the C-terminus of ubiquitin and a 

10 lysine E-amino group(s) of the acceptor protein. Once the conjugate is 

formed, it can have only two fates: non-lysosomal proteolysis mediated by 
the 26S proteasome resulting in total protein degradation, or de-conjugation 
from ubiquitin (de-ubiquitination), resulting in the rescue of the target protein 
from degradation (Wilkinson 1997, Wilkinson and Hochstrasser 1998). 

15 The regulation of p53 levels by USP25, and consequential prescribed 

increased cellular predisposition to apoptosis (programmed cell death) upon 
overexpression of USP25, could be a mechanism by which neuronal loss 
occurs in Alzheimer's Disease associated with Down Syndrome. Some 
mechanism could be invoked in explaining the sporadic AD if overexpression 

20 of USP25 occurs through a somatic change in aging neurons. 

The potential USP catalysed de-ubiquitination and rescue from 
degradation of tumour suppressors such as p53, could perhaps explain the 
link to deletions of USPs in solid tumours. De-ubiquitination could play a 
major role in the Mdm2 mediated control of p53 levels and its activation 

25 mechanism, since the ubiquitin-mediated proteasome degradation of p53 is 
an important effector arm of this recently revealed control pathway (Haupt et 
al M 1997, Kubbutat et at., 1997, Lane 1998). Halving the dose of a USP 
could give a pre-cancerous cell a selective advantage in proliferation, by 
diminishing its rate of "re-cycling" of p53, making it more difficult to achieve a 

30 threshold concentration of p53, necessary for its activation. The same 
hypothetical, gene-dose regulated, tumour suppression model could also 
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explain why trisomy 21 (Down's syndrome DS) seems to confer protection 
from development of solid tumours. Among DS children, neuroblastomas and 
nephroblastomas, and among DS adults, gynaecologic, digestive and breast 
cancers have very rarely been reported, and are significantly under- 
5 represented compared to the age-matched euploid population (Oster et al., 
1975, Satgeetal., 1998). 

Baker et al, 1999 also propose a role for USP's in tumour formation 
and cell growth. 

In recent years a number of other protein modifying polypeptide tags 
10 have been identified. Many of these are related to ubiquitin and have high 
levels of identity and similarity (determined using the BLAST algorithm, for 
instance) to ubiquitin itself. There is a recognised super family of such 
proteins which have been termed ubiquitin-like proteins (UbL) (Gong et al. 
1997, Schwarz et al. 1998). The yeast Smt3 and human SUMO-1 (PIC1, 
15 Sentrin, hSmt3C), SUMO-2 (hSmt3A) and SUMO-3 (hSMT3B) belong to the 
same family of UbL proteins with approximately 50% identity between 
themselves, and some 15-30% identity and 40-60% similarity in amino acid 
sequence to ubiquitin (Lapenta et al. 1997, Mannen et al. 1996, Kamitani et 
al. 1998, Saitoh and Hinchey, 2000). Yeast and human UBC9 are capable of 
20 conjugating equally yeast or human UbL-s, but not ubiquitin (Schwarz et al. 
1998). The SUMO-1 ,-2 and -3 have the C-terminai glycine, necessary for 
ubiquitination of the target protein's lysine residue, but unlike ubiquitin, do 
not have the Lys48 residue necessary for the formation of polyubiquitin 
chains through isopeptide bonds, which are the signal for the proteasome 
25 degradation (Saitoh and Hinchey, 2000). Nevertheless, yeast Smt3 protein 
can rescue the mutant Mif2 phenotype, a deficient centromere binding 
protein resulting in chromosome missegregation (Meluh and Koshland 
1995). SUMO-1, as well as SUMO-3 (and probably also SUMO-2) are all 
capable of being attached by UBC9 to RanGapl , a Ran GTP-ase activating 
30 protein (Kamitani et al. 1998). This ATP-dependent attachment is essential 
for the binding between modified RanGapl and RanBP2 binding protein, in 
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order to form functional nuclear pore complex, which controls export and 
import of molecules through the nuclear envelope (Mahajan et al. 1997, 
Matunis et al. 1998, Lee et al. 1998). In addition, UbL small proteins have 
been shown to modify the death domains of Fas (Okura et al. 1996), Tumour 
5 necrosis factor receptoM (Okura et al. 1996), PML (a tumour suppressor 
implicated in the pathogenesis of acute promyelocytic leukaemia) (Kamitani 
et al. 1998b) and Rad51/52 DNA repair proteins (Shen et al. 1996a). Their 
conjugating enzyme, UBC9, has been shown to interact by Y2H technique 
with RAD51/52 DNA repair proteins, and the master tumour suppressor p53 

10 (Shen et al. 1996b). Another UbL is NEDD8 (Kamitami et al 1997). 

UbL's are conjugated and cleared from their targets by enzymes. 
Several UbL hydrolase enzymes have been identified which convert 
precursor UbL to active UbL. Some such enzymes interact with ubiquitin 
itself as well as with other UbL's. Proteases involved in cleavage of 

15 conjugates of UbL with target protein have been identified for instance 

SENP1 and SUSP-1 , which were recently cloned (Kim et al. 2000, Gong et 
al. 2000a), and found to specifically cleave SUMO-1,-2 and -3, but not 
ubiquitin and NEDD8. The first human enzyme with classical USP structure 
(Cys, His domains) for which dual specificity to both ubiquitin and ubiquitin 

20 like protein was demonstrated was very recently published USP21 on 

chromosome 1q21 (Gong et al. 2000b). However, opposite from SENP1 and 
SUSP-1, this enzyme cleaves ubiquitin and Nedd8, but not SUMO-1,-2 or -3 
(Gong et al. 2000b). 

The proximal third of the chromosome 21 long arm is an exceptionally 

25 gene-poor region of the human genome as estimated by a number of criteria 
(Shimizu et al., 1995, Yaspo et al., 1995, Gardiner 1996) the estimates of 
gene-density range from one gene in a megabase to one gene in six 
megabases of genomic DNA. Until recently only three full length genes had 
been mapped in this region: STCH (a member of the hsp70 family) (Brodsky 

30 et al., 1995), RIP140 (protein functionally interacting with a variety of nuclear 
receptors such as estrogen receptor), (Cavailles et al., 1995) and ANA (a 
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member of the Tob/BTG1 family of tumour suppressors), (Kohno et al., 
1998). This region is also an example of extremely highly methylated regions 
in the human genome. 

Groet et al (1998) describe a high-resolution bacterial contig map of 

5 3.4 Mb of genomic DNA in human chromosome 21q11-q21, encompassing 
the region of elevated disomic homozygosity in Down's syndrome - 
associated abnormal myelopoiesis and leukemia, and which has shown a 
strong association with Alzheimer's disease (AD). It was suggested that the 
high resolution bacterial clone overlap map should be the basis for deriving 

10 a more complete transcriptional map of that region of the chromosome. It 
was hoped that this would lead to an explanation of the chromosome 21 ql 1 
linkage in familial early onset AD (FEOAD) families. In particular it was 
suggested that a modifier gene in that region could act together with the 
presenilin-1 gene to generate or modify the AD phenotype. 

15 Further work by the present inventors has revealed a new gene in the 

proximal third of chromosome 21 and that the product of this gene has 
ubiquitin specific protease properties. It is postulated that the gene product 
and USPs generally may have a role in AD. The work has been published in 
Groet et al (2000) after the first priority date of the present application. 

20 Valero, et al. (1999) published after the first priority date of the 

present application, have, in parallel identified this gene and pointed out the 
gene product's sequence homologies to known USP's in the conserred 
peptide domains previously identified e.g. by d'Andrea et al (1998). They 
postulate a role in Alzheimer's disease. This protein has the HUGO 

25 approved name USP25. 

According to the present invention there is provided the new use of 
the product of the USP25 gene located at human chromosome 21q 1 1-21 , a 
non-human homologue thereof or a functional fragment thereof in the 
manufacture of a medicament for use in the diagnosis, prophylaxis or 

30 treatment of cancer. 
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The cancer is usually a solid cancer, most often non-small cell lung 
carcinoma, or skin cancer. 

A functional fragment herein means a protein having ubiquitin specific 
protease activity or UbL specific protease activity and comprising a portion 
with sequence homology with the product of the said USP25 gene. 

It is believed that the ubiquitin specific protease activity of the protein 
having sequence ID 1 is responsible for its implication in the pathogenesis of 
cancer. USP activity may be determined using the technique described in 
the Examples below, in which using bacteria cotransformed with the USP 
gene and with a reporter gene encoding a fusion protein which is a ubiquitin- 
conjugated detectable protein. The protein may be an enzyme detectable by 
direct enzyme reaction, by enzyme-linked immune assay techniques, by 
autoradio-graphically or by direct staining after gel separation under 
conditions suitable to separate ubiquitin and cleaved protein from fusion 
protein. 

From experiments conducted to determine with which proteins the 
product having sequence ID 1 interacts, we have found that there is 
interaction with ubiquitin, polyubiquitin and various ubiquitin precursors, as 
well as HHR23A (Matsutani et al 1994, GenBank Accession No D21235). 
There is also interaction with other ubiquitin-like proteins and with proteins 
which are known to interact with ubiquitin-like proteins, such as Sumo-3 
(Mannen et al 1996, Kamitani et al 1996, Saitoh et al 2000, GenBank 
Accession No. NM 006937) and ubiquitin-like-specific conjugating enzyme 9 
(Schwarz et al 1998, Lee et al 1998, GenBank Accession No. U 66867). The 
isolated protein may therefore be characterised further by having a positive 
interaction in a yeast-two-hybrid procedure with one or more, preferably all 
three, proteins having the sequences of GenBank Accession Nos. D 21235, 
NM 006937 and U 66867. Ubiquitin-like specific protease activity may be 
determined using techniques analogous to those used to determine ubiquitin 
specific protease activity, by using a substrate which is a fusion protein of 
the ubiquitin-like protein of interest and a detectable protein, and using the 
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usual separation and immune based or autoradiographic identification 
techniques. 

The gene may be transcribed and translated within a cell line selected 
as positive for the native gene or an active (in terms of ubiquitin or UbL 

5 specific protease activity) fragment thereof. The gene has preferably been 
introduced into a microorganism or a cell line in a form in which it can be 
transcribed and translated and the microorganism or the cell-line, as the 
case may be, has been cultured under conditions whereby the gene is 
replicated during cell division, and is transcribed and translated into ubiquitin 

10 specific protease and the USP is recovered. Preferably the gene in the 
microorganism is recombinant DNA derived from the mRNA from celts 
having the active USP gene. Suitably the gene includes sequence ID No. 7, 
more preferably sequence ID No. 6. 

According to a further aspect of the invention there is provided a new 

15 use of a protein product having Cys, QQD and His domains specified in 
sequence IDs numbers 2, 3 and 4, respectively, in the manufacture of a 
medicament for use in the diagnosis, prophylaxis or treatment of cancer. 
The protein has ubiquitin specific protease or ubiquitin-like specific protease 
activity, and has the three specified domains in the USP or UbLSP active 

20 conformation. Preferably the protein has, outside the specified domains, 

some level of sequence homology with sequence ID 1, for instance at least 
20%, preferably at least 50%, identity with that sequence, and a level of 
similarity of at least 50%, preferably at least 70% or more with that sequence 
(in each case determined using, for instance the BLAST algorithm). 

25 Preferably the protein has sequence I.D.1 or sequence l.D.5. Homologues, 
such as the corresponding mouse product, described in Valero, et al 1999 
may be used or sequences which have the above levels of identity and 
similarity with such a sequence. 

According to a preferred embodiment the protein has sequence I.D.1 . 

30 According to another preferred embodiment the protein has sequence 

l.D.5. 
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According to a further aspect of the invention there is provided an in 
vitro method in which mammalian cells are cultured in the presence of the 
product of the USP25 gene located at human chromosome 21 q1 1 or of a 
homologue thereof, or a functional fragment of said product. 
5 According to a further aspect of the invention there is provided an in 

vitro method in which mammalian cells are cultured in the presence of a 
protein product having Cys, QQD and His domains specified in sequence 
IDs numbers 2, 3 and 4, respectively. Preferably the protein has, outside the 
specified domains, some level of sequence homology with sequence ID 1, 

10 for instance at least 20%, preferably at least 50%, identity with that 

sequence, and a level of similarity of at least 50%, preferably at least 70% or 
more with that sequence (in each case determined using, for instance the 
BLAST algorithm). Preferably the protein has sequence LD.1 or sequence 
I.D.5. Homologues, such as the corresponding mouse product, described in 

15 Valero, et al 1999 may be used or sequences which have the above levels of 
identity and similarity with such a sequence. 

In these in vitro methods the effect of the protein on cell growth, cell 
growth arrest and/or apoptosis is assessed. 

The microorganism containing the specified gene has preferably been 

20 transformed with a vector comprising at least a portion of sequence I.D.6 
comprising residues 199 to 3367, optionally including an additional 96 b.p. 
exon inserted after base 2356, and optionally including 5' and/or 3' 
untranslated regions (UTR's). The vector preferably comprises 
transcriptional and translational control sequences. The microorganism may 

25 be a yeast but is most conveniently a bacterium. 

A mammalian cell transfected with a vector preferably comprises 
exons making up a sequence comprising residues 1 99 to 3367 of sequence 
I.D.6 and optionally also regulatory genomic sequences which are present in 
wild-type chromosome which lead to enhanced expression activity. 
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The sequence from residues 199 to 3367 of sequence ID no 6 is 
specified as sequence ID7 and consists of start codon (ATG), an open 
reading frame of 3165 nucleotides, and a stop codon (TAA). 

According to a further aspect of the invention there is provided a new 
5 use of DNA including the gene located at human chromosome 21 q 11 -21 or 
a fragment thereof encoding a functional USP product in the manufacture of 
a medicament for use in the prophylaxis or treatment of cancer. The DNA is 
preferably incorporated in a gene therapy vehicle, and is preferably part of a 
vector, for instance a plasmid vector, or viral vector, capable of transfecting 
10 cells in vivo. The medicament usually includes a pharmaceutical^ 
acceptable carrier. 

The invention is illustrated further in the accompanying drawings in 

which: 



products from the experiments reported below 

Figure 3 represents sequence homologies of USPs 
and 

Figure 4 represents the results of the experiment illustrating USP 
20 properties reported below. 

The following specific description describes the work which has been 
carried out. 



25 deleted in non-small cell non carcinoma (NSCLC) for further study. The 

region contained the DNA marker with the highest NSCLC-associated loss of 
homozygosity (LOH), reported by Kohno et al. We found a shared region of 
overlap (SRO) for the hemizygous loss in other NSCLC. The current work is 
to identify genes in the SRO which have a potential role in tumour 

30 suppression. 



15 



Figure 1 represents a map of human chromosome 21 

Figure 2 represents a map showing the location of exon trapped 



Experimental 

We identify a portion of human chromosome 21 homozygously 
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A total of 42 fresh NSCLC cases have been analyzed from the 
Croatian Tumour Bank (CTB), an initiative with set rules and criteria for 
accumulation of fresh clinical tumor specimens for molecular studies 
(Spaventi ef a/., 1994). Of these, half (including tumors #47 and #61 ) were 

5 samples that were recently studied for LOH of the NM23-H1 gene (Bosnar et 
at., 1997), and the other half were fresh tumors (data obtainable from CTB, 
which also lists the tumor stage in the TNM system, grade, size and survival 
data). In each case, the tumors and normal lung tissue specimens (as 
evaluated by the surgeon) were frozen in liquid nitrogen in the operating 

10 room and further stored at -70°C Genomic DNA was isolated using standard 
procedures (Sambrook et ai, 1989). For each sample, 4pm serial frozen 
sections were cut, mounted on glass slides, and stained with hematoxylin- 
eosin (H&E). A pathologist confirmed the histologic type of the tumor and 
evaluated the percentage of normal cells within the tumor. Only samples 

15 with less than 20% non-tumor cells were used in this study. 
Markers and LOH Analysis 

Microsatellite analysis was performed using polymerase chain 
reaction (PCR) with appropriate primer pairs (sequences and PCR 
conditions as in Genome Data Base, Johns Hopkins University, Baltimore, 

20 MD), where the forward primer only from each pair was 5' fluorescently 

labeled with Applied Biosystems (ABI; Foster City, CA) Big Dyes™ (6-FAM 
or HEX). Amplification products were analysed using an ABI 310 Genetic 
Analyzer. Size standards (GeneScan 350) were mixed with every sample for 
accurate sizing; the separation of the mixture of denatured fragments was 

25 achieved by electrophoresis through a 47 cm capillary (module GS STR 
POP4 C) for approximately 30 min. Raw data were analyzed using 
GeneScan and Genotyper software. LOH ratios were calculated exactly as 
described in the GeneScan Applications manual provided by ABI. For each 
individual allele's fluorescence level, an average of 3 independent 

30 electrophoresis-analysis cycles on the ABI 31 0 was used for calculation. 



WO 00/79267 POT/G BOO/02446 

11 

Fluorescence In Situ Hybridization (FISH) 

Unstained 4pm-thick paraffin block sections were fixed to glass slides, 
and a standard pretreatment protocol was followed for formalin-fixed, 
paraffin-embedded slides. P1 -derived artificial chromosome (PAC)DNA was 
5 labelled with digoxygenin-1 1 -dUTP (Boehringer Mannhein, Germany). 
Approximately 0.5 pg of each labelled PAC DNA sample was mixed with 5 
[ig of Cot1 DNA (Gibco BRL, Gaithersburg, MD), precipitated, denatured, 
allowed to preanneal, and then applied to a denatured slide and hybridized 
overnight. Slides were washed and signal detected using anti-digoxygenin- 

10 rhodamine, followed by DAPI counterstain. Images were captured using a 
Zeiss Axioskop microscope equipped with a charge-coupled device (CCD) 
Photometries, Tucson, AZ) connected to an Apple Powermac 8100 
computer. Images were captured on 3 levels of focus, and each level was 
examined for signals using SmartCapture software (Vysis, Inc., Chicago, IL). 

15 Only nuclei with signals were counted in each level, and the number of 
signals in each cell was determined. B: FISH using a pool of PACs 90B5, 
126N20 and BAC 391 12 as a probe on the paraffin embedded sections of 
the tumour #61. Two signal nuclei are predominant. C: FISH using a pool 
of PACs 73M5 and 135E14 as a probe on the paraffin embedded sections of 

20 the tumour #61. Single signal nuclei are predominant. 
Northern Blot Analysis 

The cDNAs were labelled by random priming and hybridized to human 
multiple tissue Northern blots (Clontech, Palo Alto, CA) containing 2 |jg 
polyA + RNA per lane using the protocol recommended by the manufacturer. 
25 The exposure was for 14 hr to Molecular Dynamics (Sunnyvale, CA) 

Phosphorimager screens. The I.M.A.G.E. Consortium (Lennon et a/. t 1996) 
cDNA clone ID 824710 and the Unigene clone A002B43 have been used as 
labelled probes in separate experiments. 
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Cleavage Analysis of 

Ubiquitin-Met-p-Galactostdase Fusion Protein 

This analysis was performed essentially as described (Everett et a/., 
1997). Model fusion protein ubiquitin-Met-0-galactosidase in a pACYC184 
5 (Cm r replicon) was represented by the plasmid pACYC-Ub-Met-p-gal, a kind 
gift of R. Everett. Plasmid pRB105 containing a Saccharomyces cerevisiae 
ubiquitin-specific protease UBP2 in an IPTG-inducible pBR322 (Amp r 
replicon) was a kind gift of R. Baker, and was used as a positive control. 
The new gene USP25 was cloned from nucleotide position 203 to nucleotide 

10 position 3367 (numbering as in GenBank AF 134213) into SacUSaH cloning 
sites of the IPTG-inducible Escherichia coli expression vector pQE30 
(Qiagen, Chatsworth, CA). The E. coli XL-1 blue cells were transformed 
using a standard rubidium chloride-heat shock method with the combination 
of pACYC-Ub-Met-P-gal and either pQE30 vector, pQE30-L/SP25, or 

15 pRB105, and each of the 3 cotransormants was selected on medium 

containing chloramphenicol (42 |jg/ml) and carbenicillin (75 pg/ml). Western 
blots were prepared by electrotransfer to a nitrocellulose membrane 
(Schleicher & Schull, Keene, NH). The p-galactosidase-containing bands 
were detected by an anti-p-galactosidase polyclonal rabit antiserum (a kind 

20 gift of R. Everett) using an enhanced chemiluminescence (ECL) assay kit 
(rpn 2132; Amersham, Arlington Heights, IL) under conditions recommended 
by the manufacturer. 

Identification and cloning of USP25 

25 Twelve sequenced exon-trapped products, when analysed using 

BLAST-N against public sequence databases, revealed clusters of 
overlapping cDNA clones. Sequences of our exon-trapped products matched 
exactly the sequences of the cDNAs forming contigs with a large open 
reading frame (ORF). In three cases (see Fig.2): from EST 824710 to 

30 AA209364, from AA307805 to AA081 200 and from N92952r to Z4501 0, our 
trapped exon sequences served to bridge the gaps in the gene sequence 




WO 00/79267 



PCT/GB00/02446 



13 



using PCR and suitable restriction, ligation and chain extension techniques. 
The combined sequence (sequence ID no.5 and GenBank accession 
number AF134213) revealed a 199 bp 5'UTR, start codon, an ORF of 3165 
nucleotides encoding a protein of 1055 amino acids, a stop codon, a 3'UTR 

5 of 435 nucleotides and a polyadenylation signal. The total length (without the 
polyA) assembled is 3803 nucleotides. On multiple human tissue Northern 
blots (Fig.3) a band of 4.1 kb is visible in all 16 tissues tested (including the 
normal human lung tissue) with a varying intensity. It is most prominent in 
skeletal muscle and testis, and the latter tissue also reveals a prominent 

10 shorter hybridising transcript of 1 .4 kb. All tissues also show a larger weaker 
band of 4.9 kb, which could be due to an alternative polyadenylation site. 

In the course of this analysis, the whole genomic sequence of the two 
PACs (73M5 and 135E14) became publicly available by the German Human 
Genome Sequencing Consortium (EMBL accession numbers AJ010597 and 

1 5 AJ01 0598). Comparison of the genomic sequence with the overlapping 

cDNA clones and exon sequences revealed that 12 out of the 24 exons had 
been exon-trapped (hatched rectangles in Fig.2). It also became apparent 
that the region immediately preceeding the first exon of the gene, comprises 
the known chromosome 21 CpG island at D21S382 (also known as LL56 Not 

20 I linking clone on the Not I physical map of 21 q, Ichikawa et al., 1993). 

When the deduced polypeptide sequence was compared to Swissprot 
and other public databases using BLAST-P, a clear pattern of significant 
homologies (e=10* 6 to 10' 29 )to proteins across the evolutionary spectrum of 
eukaryotes was found (Fig.4): all of these proteins belonged to the 

25 superfamily of ubiquitin specific proteases (USP-s) or ubiquitin carboxy- 
terminal hydrolases (UCH-s) (Baker et al, 1992, Swanson et al., 1996, 
Everett et al., 1997, Wilkinson 1997, Hansen-Hagge et al., 1998, Jensen et 
al., 1998, Fujiwara et al., 1998, Wilkinson and Hochstrasser 1998, The C. 
elegans Sequencing Consortium 1 998). The polypeptide sequences were 

30 most highly conserved around the three domains (the Cys box, the QQD box 
and the His box, Fig.4) known to be essential for the main function of these 
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enzymes: the cleavage of ubiquitin at its carboxy terminus from extension 
proteins (ubiquitin precursors) and ubiquitinated proteins and protein 
fragments targetted for the degradation by the 26S proteasome pathway 
(Wilkinson and Hochstrasser 1998). The Cysteine residue at position 178 
5 and the Histidine residues at positions 599 and 607 (marked with an asterix 
in Fig.4), which were shown to be an absolute requirement for the function of 
USP-s and UCH-s (Amerik et al., 1997, Hansen-Hagge et al., 1998, 
Wilkinson and Hochstrasser 1998) were found in the correct positions in the 
sequence of the new gene. Since this is a first member of the USP family 
10 known to map onto human chromosome 21 , we named this protein USP25, 
for Ubiquitin Specific Protease on Chromosome 21. 



15 substrate was investigated by co-expression in E Coli. The complete coding 
sequence of USP25 was cloned into a T5-driven, IPTG inducible expression 
vector (pQE30). The new gene USP 21 was cloned from nucleotide position 
203 to nucleotide position 3367 (numbering as in sequence ID no. 2 into Sac 
/Sal cloning sites of the IPTG-inducible E.coli expression vector. As a 

20 positive control, the plasmid pRB105 containing a UBP2 gene encoding a 
S.Cerevisiae ubiquitin specific protease in an IPTG inducible and Amp R 
vector was used. The XL-1 blue strain of E. Coli was co-transformed with 
the plasmid containing a ubiquitin-Met-p-galactosidase model fusion protein 
in an IPTG-inducible and chloramphenicol resistant vector, in addition to 

25 either pQE30 vector, pQE30-USP25 or the positive control (pRB105). (each 
of the 3 co-transformants was selected on medium containing 
chloramphenicol (42 ygfm\) and carbenicillin (75 pg/ml). Co-transformants 
were grown to exponential phase, IPTG induced, and the crude protein 
extracts from these cultures were analysed by Western blot using an anti p- 

30 galactosidase antibody. (The western blots were prepared by electro 
transfer to a nitro cellulose membrane (Schleicher and Schuel.)). The p- 



The novel protein (USP25) cleaves ubiquitin from carboxy- 
terminal fusion proteins 

The ability of USP25 to cleave a model ubiquitin fusion protein 
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galactocidase containing bands were detected by an anti-B galactosidase 
polyclonal rabbit anti serum using enhanced-chemiluminescence assay kit 
(ECL, Amersham rpn2132) under conditions recommended by the 
manufacturer. 

5 As can be seen in Fig.4, the uncleaved Ub-Met-B-gal substrate (band 

labelled with an asterix in Fig.4, lane 4) converts to an 8 kDa shorter band 
(triangle in Fig.4) in the cells co-transformed with either USP25(lanes 5,6) or 
the yeast UBP2 expressing plasmid (lanes 9,10). Constitutive expression of 
USP25 (lane 5) is quite sufficient to cleave to completeness the low levels of 

10 model substrate. The more prominent and highly induced band migrating 
slightly further in the gel than the de-ubiquitinated cleavage product is the 
truncated form of b-galactosidase expressed by the XL-1 blue bacteria 
(compare to lanes 1 ,2 in Fig.4). This result demonstrates that the novel gene 
product named USP25 can efficiently function as a de-ubiquitinating 

1 5 enzyme. 

From the homologies in the functional domains and from its ability to 
hydrolyse the bond between the C-terminal double glycine of ubiquitin and 
the linking methionine residue (Fig.5), it can be concluded that USP25 is a 
member of ubiquitin specific proteases. 

20 Determination of Proteins with which USP25 interacts 

Functional analysis of USP25 was performed with the aim of detecting 
the cellular proteins which interact with the USP25 protein through protein- 
protein interaction, using Yeast-Two-Hybrid (Y2H) approach. 
Saccharomyces Cerevisiae yeast has well characterised ubiquitin activating, 

25 conjugating and ligating ezymatic machinery, capable of ubiquitinating 
human proteins (Scheffner et al. 1998). A cDNA library from human brain 
cloned in "prey" vector, was co-transfected to yeast cells with USP25 cloned 
in "bait" vector. Since ubiquitin cleaving activity of USP25 was proven (Groet 
et al. 2000), this technique has a theoretical chance of detecting the natural 

30 cellular substrates for ubiquitin cleavage and de-ubiquitination by USP25. 
Since the action of ubiquitin cleavage is very rapid (Wilkinson and 
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Hochstrasser 1998), the cleavage and dissociation from its natural 
substrates for fully active USP25 could preclude the ability to detect the 
interaction through Y2H. In addition, the artificial cross-de-ubiquitination of 
yeast's own proteins by an overexpressed USP25 could theoretically be 

5 harmful for the yeast cell and/or for the molecular interactions required for 
the Y2H. For these reasons we performed Y2H using USP25-C178A, a site 
directed mutant we recently engineered (the mutation being of the key Cys 
residue in the Cys region) which abolishes the capacity for cleavage of 
ubiquitin by USP25, but should not interfere with the binding of USP25 to its 

10 natural ubiquitinated substrates, since this residue is conserved between all 
UCH-s and USP-s so far identified. 

Y2H experiment using USP25-C178A cloned in the yeast two hybrid 
"bait" vector pAS2 (Clontech) co-transformed into yeast cells together with a 
human adult brain cDNA library in the "prey" vector pACT2 (Clontech). 

15 Interacting events were visualized by the activation of transcription of all 
three reporter genes: Ade2, Mel1 and His3. Interacting "prey" sequences 
were verified by PCR-sequencing on the ABI310 automated sequencer, 
using universal vector primers, and analysed by BLAST search on non- 
redundant genome and transcriptome sequence databases. The accession 

20 numbers of the sequences found to be interacting, from the GenBank 
database are given in the table. 



WO 00/79267 PCT/G BOO/02446 

17 



Table 1. Summary of frequency and identities of specific interacting 
proteins from human brain with USP25-C178A, detected using Yeast- 
Two-Hybrid technique 



Summary of Results by 
decreasing frequency of 
detection of baits 


Number of specific 
independent clones 
"fished" by Y2H 


Accession number 


HHR23A 


8 clones 


D21235 


SUMO-3 


8 clones 


NM 006937 


human UBC9 


5 clones 


U66867 


polyUbiquitin 


4 clones 


AB009010 


Ubiquitin 


3 clones 


X04803 


Ran BP2 protein 


1 clone 


NM 006267 


Various ubiquitin-like 
precursors (1 or 2 clones 
each): 


4 clones 




Other proteins (1 or 2 
clones each) 


10 clones 





Conclusions - Interaction with HHR23A 

20 DNA repair plays a key role in prevention of carcinogenesis and 

mutagenesis. This is potentially of special significance to solid tumours 
which have exposure to UV and chemical carcinogens as the major risk 
factor, such as cancers of the skin and lung. HHR23A is a homologue of 
yeast RAD23 protein (Masutani et al. 1994), involved in DNA excision-repair 

25 after UV damage and implicated in spindle pole body duplication and cell 
cycle progression in yeast (Watkins et al. 1993, Biggins et al. 1996). Human 
homologues HHR23A and B (Masutani et al. 1994) both belong to a group of 
proteins which, when mutated, lead to Xeroderma Pigmentosum, a rare 
autosomal recessive disorder associated with a high incidence of sunlight 

30 (UV) induced skin cancers. 

More importantly, hRAD23A has also been isolated as a primary 
interacting protein by the same Y2H technique using E6AP as a "bait" 
(Kumar et al. 1999). E6AP (Human Papilloma Virus E6 associated protein) 
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functions as one of the two so far detected ubiquitin ligases (attaching 
ubiquitin and labelling for degradation) for the master tumour suppressor p53 
(Scheffner et al. 1993). The p53 and HHR23A are the only two so far proven 
targets for this ubiquitin ligase (Kumar et al. 1999).. Since USP25 shows 

5 high rate of target preference for HHR23A (see Table 1 ), and both HHR23A 
and p53 are ubiquitinated by E6AP, it could mean that they are both de- 
ubiquitinated by USP25. The fact that Y2H with USP25 did not pick up p53 is 
understandable, because p53 is expressed in small traces (very low level) in 
normal tissues, and gets only accumulated and activated following DNA 

10 damage or other stimuli for programmed cell death (apoptosis) (Haupt et al. 
1997, Kubbutat et al. 1997, Lane 1998). Further experiments are therefore 
justified to provoke the p53 response, and monitor the effects of USP25 on 
p53 levels. 

Moreover, lack of functional E6AP accelerates the polyglutamine- 

15 induced neuronal cell death in the mouse model for the neurodegenerative 
disease Spinocerebellar-ataxia 1 (SCA1) (Cummings et al. 1999). Lack of 
E6AP gene in a mouse expressing the polyglutamine stretch mutation of 
SCA1 protein dramatically reduces the presence of ubiquitinated 
intranuclear neuronal inclusions, but drastically accelerates the neuronal 

20 degeneration and cell death (Cummings et al. 1999). A very similar effect 
has been observed in Huntington's Disease (HD), where a dominant 
negative mutant of a ubiquitin conjugating enzyme (UBC3), when co- 
expressed in cultured neurons with the huntingtin protein bearing the 
polyglutamine extension (mutation causing HD), drastically reduces the 

25 presence of ubiquitinated intranuclear neuronal inclusions, but drastically 
accelerates the neuronal degeneration and cell death (Saudou et al. 1998). 
If USP25 de-ubiquitinates a similar set of target proteins to the ones 
ubiquitinated by E6AP, then the overexpression of USP25 may lead to 
similar effects as the inhibition of ubiquitin conjugation by E6AP. It is 

30 therefore justified to examine the effects of overexpression of USP25 (and 
other USP-s) on neuronal toxicity. 
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Interaction with human UBC9 and ubiquitin like proteins (UbL) 

In yeast, there are 13 ubiquitin conjugating enzymes (E2 enzymes) 
(Scheffner et al. 1998). Only two (UBC3, mentioned previously in context 
with HD, and UBC9) are essential for cell cycle progression. Without UBC3 

5 the cell cycle is arrested at the transition point from G1 to S phase, whereas 
without UBC9 the cell cycle is arrested at the transition point from G2 to M 
phase (Scheffner et al. 1998). Yeast UBC9, and its mammalian homologue, 
e.g. human UBC9 have a special function among all other UBC(E2) 
enzymes, in that they are specifically not conjugating to target proteins the 

10 molecule of ubiquitin, but rather of Ubiquitin-Like small proteins (UbL) 

(Gong et al. 1997, Schwarz et al. 1998). The yeast Smt3 and human SUMO- 
1 (PIC1, Sentrin, hSmt3C), SUMO-2 (hSmt3A) and SUMO-3 (hSMT3B) 
belong to the same family of UbL proteins with approximately 50% identity 
between themselves, and some 15-30% identity and 40-60% similarity in 

15 amino acid sequence to ubiquitin (Lapenta et al. 1997, Mannen et al. 1996, 
Kamitani et al. 1998, Saitoh and Hinchey, 2000). Yeast and human UBC9 
are capable of conjugating equally yeast or human UbL-s, but not ubiquitin 
(Schwarz et al. 1998). The SUMO-1 ,-2 and -3 have the C-terminal glycine, 
necessary for ubiquitination of the target protein's lysine residue, but unlike 

20 ubiquitin, do not have the Lys48 residue necessary for the formation of 

polyubiquitin chains through isopeptide bonds, which are the signal for the 
proteasome degradation (Saitoh and Hinchey, 2000). Nevertheless, yeast 
Smt3 protein can rescue the mutant Mif2 phenotype, a deficient centromere 
binding protein resulting in chromosome missegregation (Meluh and 

25 Koshland 1995). SUMO-1 , as well as SUMO-3 (and probably also SUMO-2) 
are all capable of being attached by UBC9 to RanGapl, a Ran GTP-ase 
activating protein (Kamitani et al. 1998). This ATP-dependent attachment is 
essential for the binding between modified RanGapl and RanBP2 binding 
protein, in order to form functional nuclear pore complex, which controls 

30 export and import of molecules through the nuclear envelope (Mahajan et al. 
1997, Matunis et al. 1998, Lee et al. 1998). In addition, UbL small proteins 
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have been shown to modify the death domains of Fas (Okura et al. 1 996), 
Tumour necrosis factor receptorl (Okura et al. 1996), PML (a tumour 
suppressor implicated in the pathogenesis of acute promyelocytic 
leukaemia) (Kamitani et al. 1998b) and Rad51/52 DNA repair proteins (Shen 
5 et al. 1996a). Their conjugating enzyme, UBC9, has been shown to interact 
by Y2H technique with RAD51/52 DNA repair proteins, and the master 
tumour suppressor p53 (Shen et al. 1996b). 

The USP25 Y2H data show clear pattern of interaction in the UBC9 
pathway. Interaction with UBC9 itself, is to our knowledge the first of the kind 
10 demonstration of a direct protein-protein interaction between a USP and a 
conjugating enzyme. Interaction with RanBP2 and SUMO-3 clearly shows 
that USP25 could be sharing the similar target repertoire as UBC9 (Saitoh et 
al. 1997). USP25 may be removing the SUMO (UbL) molecules attached to 
targets by UBC9. Alternatively, USP25 may be preparing the UbL-s for 
15 attachment by UBC9 to targets, by removing the oligopeptide extensions 
after the C-terminal Gly-Gly group from the UbL-s. The only ubiquitin 
protease found to be essential for yeast cell cycle progression, Ulp1 , was 
found to be specific to Smt3 removal, and not to ubiquitin (Li and 
Hochstrasser 1999). This protease also had a completely different sequence 
20 from known USP-s and UCH-s. Its human homologues, SENP1 and SUSP-1 , 
were recently cloned (Kim et al. 2000, Gong et al. 2000a), and found to 
specifically cleave SUMO-1 ,-2 and -3, but not ubiquitin and NEDD8, another 
UbL (Kamitani et al. 1997). The first human enzyme with classical USP 
structure (Cys, His domains) for which dual specificity to both ubiquitin and 
25 ubiquitin like protein was demonstrated was very recently published USP25 
on chromosome 1q21 (Gong et al. 2000b). However, opposite from SENP1 
and SUSP-1, this enzyme cleaves ubiquitin and Nedd8, but not SUMO-1, -2 
or -3 (Gong et al. 2000b). 

If USP25 cleaves the same UbL proteins to which it binds by Y2H 
30 (see Table 1 ), it would appear that it is the first classical USP capable of 
cleaving both ubiquitin and human SUMO family, which maybe very 
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significant to its function. Moreover, it is possible that USP25 is actually 
specific for SUMO-3 rather than SUMO-1 or SUMO-2. The distinction 
between these targets would be the first of the kind, but remains to be further 
confirmed. The previously observed interaction with hRAD23A (Table 1 ), as 
5 well as with other various precursor proteins containing the ubiquitin-like- 
domains (Table 1 ), could be linked also to its affinity to a certain type of 
ubiquitin like domain. 

Finally, and very importantly, SUMO molecules have been shown to 
attach post-translationally to p53 tumour suppressor itself, by the 

10 conjugating enzyme UBC9 (Rodrigues et al. 1999, Gostissa et al. 1999). 
This modification for SUMO-1 in vitro requires only SUMO-1, the SUMO-1 
activating enzyme and UBC9. The authors of Rodrigues et al. therefore 
conclude that SUMO-1 modification pathway acts as a potential regulator of 
the p53 response and may represent a novel target for the development of 

15 therapeutically useful modulators of the p53 response (Rodrigues et al. 
1999). 

We would propose that the cleavage of ubiquitin like proteins, 
and its association with UbC9 could be an alternative pathway explaining 
USP25's role in cell cycle control, and through it, its role in control of 
20 programmed cell death, with direct implications in both tumour suppression 
and neurodegeneration. 



Figure Legends 

Figure 1. Identification of the Shared Region of Overlap 
25 (S.R.O.) for hemizygous deletions in 2lq11-q21 in NSCLC.A: Cytogenetic 
map, Not I long range physical map (Ishikawa, et a/., 1993), YAC contig 
(Nizetic, ef al 1994, Shinizu et al 1995 and Bosch et al 1996), and bacterial 
contig, (Groet et al 1998) are shown in consecutive horizontal layers, 
respectively, above the line showing the markers used in the LOH analysis 
30 (oval symbols). Markers are named as in Genome DataBase (prefix U D21 M 
omitted). In the column under each marker an "X" (symbolizing LOH), "+ M 
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(an absence of LOH) and "IT (un-informative, homozygous result) for that 
marker in the set of eleven Croatian Tumour Bank (CTB) tumours, or in 
individual tumours #47 and #61 are shown. NT=not tested. For comparison 
with our data, markers used as probes on the genomic Southern blot of the 

5 NSCLC cell line, and/or in LOH analysis of fresh tumours in the study by 

Kohno and co-workers (Kohno et al 1998) are indicated above the empty bar 
symbolizing the homozygous deletion they found. In our data, hatched bars 
indicate hemizygous deletions, and black filled bars indicate segments 
showing absence of LOH or deletions. Squared symbols "X" and 11 +" stand 

10 for predominantly single and predominantly double signal, respectively, 

detected by FISH on interphase nuclei of the paraffin embedded sections of 
the tumour #61 , when PAC clones named and indicated as bold lines in the 
PAC contig above the markers line, were used as probes. 



15 deduced from overlapping sequence analysis (white rectangles) defining the 
exon-intron structure of the new gene USP25. Top half shows two PACs 
73M5 and 135E14, also used as FISH probes in Fig. 1 , which were the 
source of genomic DNA for exon trapping. Exon locations on the PACs are 
shown with vertical bars, and the 50 kbp scale bar refers to this part. Bottom 

20 half consists of overlapping cDNA fragments corresponding to exons above 
them, drawn in the same scale, (500 bp scale bar is shown). Names of 
cDNA clones are as in dB-EST and UniGene databases, 824710 is the 
address of the clone in the IMAGE Consortium collection. The complete 
cDNA sequence for the whole gene is the new GenBank entry with the 

25 accession number AF 1 3421 3. 



other eukaryotic members of the superfamily of USP-s. The protein BAP-1 is 
actually from the family of Ubiquitin C-terminal Hydrolases, a distinct sub- 
family of this superfamily, showing homology only in the single key 
30 aminoacids in the Cys and His domains. Two reports show the localisations 



Figure 2. Trapped exons (hatched rectangles) and exons 



Figure 3. Comparison of protein sequences of USP25 to 
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of the highly homologous sequences for the HAUSP gene to 3p21 (Kashuba, 
et al 1997) and 16p13 (Robinson, et at 1998), respectively. 



USP25 on a model ubiquitin fusion protein. Western blot of an SDS-PAGE 
5 was detected using an anti-p-galactosidase antiserum. Lanes 1,2: the E. coli 
XL-1 blue cells alone (in all cases second line of the pair is +1PTG). Lanes 
3,4: same cells co-transfected with the model fusion protein encoding 
plasmid pACYC-UB-Met-0-galactosidase protein, band labelled with an 
asterix). Lanes 5,6: as lanes 3,4 except pQE30-USP25 (full length USP25 
10 gene cloned in the pQE30 expression vector) was added instead of pQE30. 
Lanes 7,8: same as lanes 3,4 except pRB105 (yeast de-ubiquitinating 
enzyme UBP2) was transfected instead of pQE30. Lanes 9,10: over- 
exposure of lanes 7,8. Note the presence of the 8kDa shorter, de- 
ubiquitinated Met-p-galactosidase (band labelled with a triangle). 

15 
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CLAIMS 

1 . Use of the product of the USP25 gene located at human 
chromosome 21 q 1 1-21 or a non-human homolpgue thereof or a functional 
fragment of the product manufacture of a medicament for use in the 

5 diagnosis, prophylaxis or treatment of cancer. 

2. Use according to claim 1 in which the cancer is a solid 
tumour, for instance non-small cell lung carcinoma or skin cancer. 

3. Use according to claim 1 or claim 2 in which the product is 
obtained from a microoganism or cell line in which the gene as recombinant 

10 DNA has been introduced in a form in which it is replicated upon cell 
division, and is transcribed and translated. 

4. Use according to claim 3 in which the recombinant DNA is 
incorporated in a vector which comprises at least a portion of sequence I.D.6 
comprising residues 199 to 3367, optionally including an additional 96 b.p. 

15 exon inserted after base 2356, and optionally including 5* and/or 3' 
untranslated regions (UTR's). 

5. Use of a protein product having Cys, QQD and His domains 
specified in sequence IDs numbers 2, 3 and 4, respectively, in the 
manufacture of a medicament for use in the diagnosis, prophylaxis or 

20 treatment of cancer. 

6. Use according to claim 5 in which the protein has sequence 
LD. No. 1. 

7. Use according to claim 5 in which the protein has sequence 
LD. No. 5. 

25 8. An in vitro method in which mammalian cells are cultured in 

the presence of a protein product having Cys, QQD and His domains 
specified in sequence IDs numbers 2, 3 and 4, respectively, and the effect of 
protein on cell growth, cell growth arrest and/or apoptosis is assessed. 
9. A method according to claim 8 in which the protein has 

30 sequence I.D.1 or sequence I.D.5. 
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1 0. Use of DNA including the gene located at human 
chromosome 21 q 11-21 (USP25) or a fragment thereof encoding a functional 
USP or UbLSP product in the manufacture of a medicament for use jn the 
prophylaxis or treatment of cancer. 
5 11. Pharmaceutical composition comprising a gene therapy 

vehicle and DNA including the gene located at human chromosome 21 q 1 1- 
21 (USP25) or a fragment thereof encoding a functional USP or UbLSP 
product in transcribable form. 
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SEQ ID 1 protein: 

10 M7VEQNVLQQSAAQKHQQTFLNQLREITGINDTQILQQALKDSNGNLELAV 
AFLTAKNAKTPQQEETTYYQTALPGNDRYISVQSQADTNVIDLTGDDKDDL 
QRAIALSLAESNRAFRETGITDEEQAISRVLEASIAENKACLKRTPTEVWRD 
SRNPYDRKRQDKAPVGLKNVGNTCWFSAVIQSLFNLLEFRRLVLNYKPPS 
NAQDLPRNQKEHRNLPFMRELRYLFALLVGTKRKYVDPSRAVEILKDAFK 

15 SNDSQQQDVSEFTHKLLDWLEDAFQMKAEEETDEEKPKNPMVELFYGRF 
LAVGVLEGKKFENTEMFGQYPLQVNGFKDLHECLEAAMIEGEIESLHSEN 
SGKSGQEHWFTELPPVLTFELSRFEFNQALGRPEKIHNKLEFPQVLYLDR 
YMHRNREITRIKREEIKRLKDYLTVLQQRLERYLSYGSGPKRFPLVDVLQYA 
LEFASSKPVCTSPVDDIDASSPPSGSIPSQTLPSTTEQQGALSSELPSTSPS 

20 SVAAISSRSVIHKPFTQSRIPPDLPMHPAPRHITEEELSVLESCLHRWRTEIE 
NDTRDLQESISRIHRTIELMYSDKSMIQVPYRLHAVLVHEGQANAGHYWAY 
IFDHRESRWMKYNDIAVTKSSWEELVRDSFGGYRNASAYCLMYINDKAQFL 
IQEEFNKETGQPLVGIETLPPDLRDFVEEDNQRFEKELEEWDAQLAQKALQ 
EKLLASQKLRESETSVTTAQAAGDPEYLEQPSRSDFSKHLKEETIQIITKASH 

25 EHEDKSPETVLQSAIKLEYARLVKLAQEDTPPETDYRLHHWVYFIQNQAPK 
KIIEKTLLEQFGDRNLSFDERCHNIMKVAQAKLEMIKPEEVNLEEYEEWHQD 
YRKFRETTMYLIIGLENFQRESYIDSLLFLICAYQNNKELLSKGLYRGHDEELI 
SHYRRECLLKLNEQAAELFESGEDREVNNGLIIMNEFIVPFLPLLLVDEMEEK 
DILAVEDMRNRWCSYLGQEMEPHLQEKLTDFLPKLLDCSMEIKSFHEPPKL 

30 PSYSTHELCERFARIMLSLSRTPADGR 
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SEQ ID no 2: (polypeptide) 
GLKNVGNTCWFSAVIQSL 

SEQ ID no 3: (polypeptide) 
5 QQDVSEFTHKLLDWLED 

SEQ ID no 4: (polypeptide) 
YRLHAVLVHEGQANAGHYWAY 

10 SEQ ID no 5: (polypeptide) 

MTVEQNVLQQSAAQKHQQTFLNQLREITGINDTQILQQALKDSNGNLELA 
VAFLTAKNAKTPQQEETTYYQTALPGNDRYISVGSQADTNVIDLTGDDKD 
D LQ RAI ALS LAESNRAFRETG ITD EEQ AIS RVLEAS I AEN KAC LKRTPTEVW 
RDSRNPYDRKRQDKAPVGLKNVGNTCWFSAVIQSLFNLLEFRRLVLNYKP 

1 5 PSN AQDLPRNQKEHRNLPFMRELRYLFALLVGTKRKYVDPSRAVEILKDA 
FKSNDSQQQDVSEFTHKLLDWLEDAFQMKAEEETDEEKPKNPMVELFY 
GRFLAVGVLEGKKFENTEMFGQYPLQVNGFKDLHECLEAAMIEGEIESLH 
SENSGKSGQEHWFTELPPVLTFELSRFEFNQALGRPEKIHNKLEFPQVLY 
LDRYMHRNREITRIKREEIKRLKDYLTVLQQRLERYLSYGSGPKRFPLVDV 

20 LQYALEFASSKPVCTSPVDDIDASSPPSGS1PSQTLPSTTEQQGALSSELP 
STSPSSVAAISSRSVIHKPFTQSRIPPDLPMHPAPRHITEEKLSVLESCLHR 
WRTEIENDTRDLQESISRIHRTIELMYSDKSMIQVPYRLHAVLVHEGQANA 
GHYWAYIFDHRESRWMKYNDIAVTKSSWEELVRDSFGGYRNASAYCLM 
YINDKAQFLIQEEFNKETGQPLVGIETLPPDLRDFVEEDNQRFEKELEEW 

25 DAQLAQKALQEKLLASQKLRESETSVTTAQAAGDPEYLEQPSRSDFSKH 
LKEETIQIITKASHEHEDKSPETVLQSIMMTPNMQGIIMAIGKSRSVYDRCG 
PEAGFFKAIKLEYARLVKLAQEDTPPETDYRLHHVWYFIQNQAPKKIIEKT 
LLEQFGDRNLSFDERCHNIMKVAQAKLEMIKPEEVNLEEYEEWHQDYRK 
FRETTMYLIIGLENFQRESYIDSLLFLICAYQNNKELLSKGLYRGHDEELIS 

30 HYRRECLLKLNEQAAELFESGEDREVNNGLIIMNEFIVPFLPLLLVDEMEE 
KDILAVEDMRNRWCSYLGQEMEPHLQEKLTDFLPKLLDCSMEIKSFHEP 
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PKLPSYSTHELCERFARIMLSLSRTPADGR 



10 > 
> 
> 
> 
> 

15 > 
> 
> 
> 
> 

20 > 
> 
> 
> 
> 

25 > 
> 
> 
> 
> 

30 > 
> 
> 
> 
> 

35 > 
> 
> 
> 



:Q ID no 6: 
DNA 

ASE COUNT 1223 a 752 c 841 g 987 1 
ORIGIN 

1 acagtcggcg tttcgccgcc tgcccgcggt gcccgcgcac gccggccgcc atcgccttcg 
61 cgcctggctg gcgggggcgc tgtcctccca ggccgtccgc gccgctccct ggagctcggc 
121 ggagcgcggc agccagggcc ggcggaggcg cgaggagccg ggcgccaccg ccgccgccgc 
181 cgccgccgcc gcgggggcca tgaccgtgga gcagaacgtg ctgcagcaga gcgcggcgca 
241 gaagcaccag cagacgtttt tgaatcaact gagagaaatt acggggatta atgacaccca 
301 gatactacag caagccttga aggatagtaa tggaaacttg gaattagcag tggctttcct 
361 tactgcgaag aatgctaaga cccctcagca ggaggagaca acttactacc aaacagcact 
421 tcctggcaat gatagataca tcagtgtggg aagccaagca gatacaaatg tgattgatct 
481 cactggagat gataaagatg atcttcagag agcaattgcc ttgagtttgg ccgaatcaaa 
541 cagggcattc agggagactg gaataactga tgaggaacaa gccattagca gagttcttga 
601 agccagcata gcagagaata aagcatgttt gaagaggaca cctacagaag tttggaggga 
661 ttctcgaaac ccttatgata gaaaaagaca ggacaaagct cccgttgggc taaagaatgt 
721 tggcaatact tgttggttta gtgctgttat tcagtcatta tttaatcttt tggaatttag 
781 aagattagtt ctgaattaca agcctccatc aaatgctcaa gatttacccc gaaaccaaaa 
841 ggaacatcgg aatttgcctt ttatgcgtga gctgaggtat ctatttgcac ttcttgttgg 
901 taccaaaagg aagtatgttg atccatcaag agcagttgaa attcttaagg atgctttcaa 
961 atcaaatgac tcacagcagc aagatgtgag tgagtttaca cacaaattat tagattggtt 
1021 agaagatgcc ttccaaatga aagctgaaga ggagacggat gaagagaagc caaagaaccc 
1081 catggtagag ttgttctatg gcagattcct ggctgtggga gtacttgaag gtaaaaaatt 
1141 tgaaaacact gaaatgtttg gtcagtaccc acttcaggtc aatgggttca aagatctgca 
1201 tgagtgccta gaagctgcaa tgattgaagg agaaattgag tctttacatt cagagaattc 
1261 aggaaaatca ggccaagagc attggtttac tgaattacca cctgtgttaa catttgaatt 
1321 gtcaagattt gaatttaatc aggcattggg aagaccagaa aaaattcaca acaaattaga 
1381 atttccccaa gttttatatt tggacagata catgcacaga aacagagaaa taacaagaat 
1441 taagagggaa gagatcaaga gactgaaaga ttacctcacg gtattacaac aaaggctaga 
1501 aagatattta agctatggtt ccggtcccaa acgattcccc ttggtagatg ttcttcagta 
1561 tgcattggaa tttgcctcaa gtaaacctgt ttgcacttct cctgttgacg atattgacgc 
1621 tagttcccca cctagtggtt ccataccatc acagacatta ccaagcacaa cagaacaaca 
1681 gggagcccta tcttcagaac tgccaagcac atcaccttca tcagttgctg ccatttcatc 
1741 gagatcagta atacacaaac catttactca gtcccggata cctccagatt tgcccatgca 
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> 1 801 tccggcacca aggcacataa cggaggaaga actttctgtg ctggaaagtt gtttacatcg 

> 1861 ctggaggaca gaaatagaaa atgacaccag agatttgcag gaaagcatat ccagaatcca 

> 1921 tcgaacaatt gaattaatgt actctgacaa atctatgata caagttcctt atcgattaca 

> 1981 tgccgtttta gttcacgaag gccaagctaa tgctgggcac tactgggcat atatttttga 

5 > 2041 tcatcgtgaa agcagatgga tgaagtacaa tgatattgct gtgacaaaat catcatggga 

> 2101 agagctagtg agggactctt ttggtggtta tagaaatgcc agtgcatact gtttaatgta 

> 2161 cataaatgat aaggcacagt tcctaataca agaggagttt aataaagaaa ctgggcagcc 

> 2221 ccttgttggt atagaaacat taccaccgga tttgagagat tttgttgagg aagacaacca 

> 2281 acgatttgaa aaagaactag aagaatggga tgcacaactt gcccagaaag ctttgcagga 
10 > 2341 aaagctttta gcgtctcaga aattgagaga gtcagagact tctgtgacaa cagcacaagc 

> 2401 agcaggagac ccagaatatc tagagcagcc atcaagaagt gatttctcaa agcacttgaa 

> 2461 agaagaaact attcaaataa ttaccaaggc atcacatgag catgaagata aaagtcctga 

> 2521 aacagttttg cagtcggcaa ttaagttgga atatgcaagg ttggttaagt tggcccaaga 

> 2581 agacacccca ccagaaaccg attatcgttt acatcatgta gtggtctact ttatccagaa 
15 > 2641 ccaggcacca aagaaaatta ttgagaaaac attactagaa caatttggag atagaaattt 

> 2701 gagttttgat gaaaggtgtc acaacataat gaaagttgct caagccaaac tggaaatgat 

> 2761 aaaacctgaa gaagtaaact tggaggaata tgaggagtgg catcaggatt ataggaaatt 

> 2821 cagggaaaca actatgtatc tcataattgg gctagaaaat tttcaaagag aaagttatat 

> 2881 agattccttg ctgttcctca tctgtgctta tcagaataac aaagaactct tgtctaaagg 
20 > 2941 cttatacaga ggacatgatg aagaattgat atcacattat agaagagaat gtttgctaaa 

> 3001 attaaatgag caagccgcag aactcttcga atctggagag gatcgagaag taaacaatgg 

> 3061 tttgattatc atgaatgagt ttattgtccc atttttgcca ttattactgg tggatgaaat 

> 3121 ggaagaaaag gatatactag ctgtagaaga tatgagaaat cgatggtgtt cctaccttgg 

> 31 81 tcaagaaatg gaaccacacc tccaagaaaa gctgacagat tttttgccaa aactgcttga 
25 > 3241 ttgttctatg gagattaaaa gtttccatga gccaccgaag ttaccttcat attccacgca 

> 3301 tgaactctgt gagcgatttg cccgaatcat gttgtccctc agtcgaactc ctgctgatgg 

> 3361 aagataaact gcacactttc cctgaacaca ctgtataaac tctttttagt tcttaaccct 

> 3421 tgccttcctg tcacagggtt tgcttgttgc tgctatagtt tttaactttt ttttatttta 

> 3481 ataactgcaa aagacaaaat gactatacag actttagtca gactgcagac aataaagctg 
30 > 3541 aaaatcgcat ggcgctcaga cattttaacc ggaactgatg tataatcaca aatctaattg 

> 3601 attttattat ggcaaaacta tgcttttgcc accttcctgt tgcagtatta ctttgctttt 

> 3661 atcttttctt tctcaacagc tttccattca gtctggatcc ttccatgact acagccattt 

> 3721 aagtgttcag cactgtgtac gatacataat atttggtagc ttgtaaatga aataaagaat 

> 3781 aaagttttat ttatggctac eta 

35 

SEQ ID no 7 dsDNA 



> ca tgaccgtgga geagaaegtg ctgeagcaga gcgcggcgca 

> 241 gaagcaccag cagacgtttt tgaatcaact gagagaaatt aeggggatta atgacaccca 
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> 
> 
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> 

10 > 
> 
> 
> 
> 

15 > 
> 
> 
> 
> 

20 > 
> 
> 
> 
> 

25 > 
> 
> 
> 
> 

30 > 
> 
> 
> 
> 

35 > 
> 
> 
> 
> 



301 gatactacag caagccttga aggatagtaa tggaaacttg gaattagcag tggctttcct 
361 tactgcgaag aatgctaaga cccctcagca ggaggagaca acttactacc aaacagcact 
421 tcctggcaat gatagataca tcagtgtggg aagccaagca gatacaaatg tgattgatct 
481 cactggagat gataaagatg atcttcagag agcaattgcc ttgagtttgg ccgaatcaaa 
541 cagggcattc agggagactg gaataactga tgaggaacaa gccattagca gagttcttga 
601 agccagcata gcagagaata aagcatgttt gaagaggaca cctacagaag tttggaggga 
661 ttctcgaaac ccttatgata gaaaaagaca ggacaaagct cccgttgggc taaagaatgt 
721 tggcaatact tgttggttta gtgctgttat tcagtcatta tttaatcttt tggaatttag 
781 aagattagtt ctgaattaca agcctccatc aaatgctcaa gatttacccc gaaaccaaaa 
841 ggaacatcgg aatttgcctt ttatgcgtga gctgaggtat ctatttgcac ttcttgttgg 
901 taccaaaagg aagtatgttg atccatcaag agcagttgaa attcttaagg atgctttcaa 
961 atcaaatgac tcacagcagc aagatgtgag tgagtttaca cacaaattat tagattggtt 
1021 agaagatgcc ttccaaatga aagctgaaga ggagacggat gaagagaagc caaagaaccc 
1081 catggtagag ttgttctatg gcagattcct ggctgtggga gtacttgaag gtaaaaaatt 
1 141 tgaaaacacl gaaatgtttg gtcagtaccc acttcagglc aatgggttca aagatctgca 
1201 tgagtgccta gaagctgcaa tgattgaagg agaaattgag tctttacatt cagagaattc 
1261 aggaaaatca ggccaagagc attggtttac tgaattacca cctgtgttaa catttgaatt 
1321 gtcaagattt gaatttaatc aggcattggg aagaccagaa aaaattcaca acaaattaga 
1381 atttccccaa gttttatatt tggacagata catgcacaga aacagagaaa taacaagaat 
1441 taagagggaa gagatcaaga gactgaaaga ttacctcacg gtattacaac aaaggctaga 
1501 aagatattta agctatggtt ccggtcccaa acgattcccc ttggtagatg ttcttcagta 
1561 tgcattggaa tttgcctcaa gtaaacctgt ttgcacltct cctgttgacg atattgacgc 
1621 tagttcccca cctagtggtt ccataccatc acagacatta ccaagcacaa cagaacaaca 
1681 gggagcccta tcttcagaac tgccaagcac atcaccttca tcagttgctg ccatttcatc 
1741 gagatcagta atacacaaac catttactca gtcccggata cctccagatt tgcccatgca 
1801 tccggcacca aggcacataa cggaggaaga actttctgtg ctggaaagtt gtttacatcg 
1861 ctggaggaca gaaatagaaa atgacaccag agatttgcag gaaagcatat ccagaatcca 
1921 tcgaacaatt gaattaatgt actctgacaa atctatgata caagttcctt atcgattaca 
1981 tgccgtttta gttcacgaag gccaagctaa tgctgggcac tactgggcat atatttttga 
2041 tcatcgtgaa agcagatgga tgaagtacaa tgatattgct gtgacaaaat catcatggga 
2101 agagctagtg agggactctt ttggtggtta tagaaatgcc agtgcatact gtttaatgta 
2161 cataaatgat aaggcacagt tcctaataca agaggagttt aataaagaaa ctgggcagcc 
2221 ccttgttggt atagaaacat taccaccgga tttgagagat tttgttgagg aagacaacca 
2281 acgatttgaa aaagaactag aagaatggga tgcacaactt gcccagaaag ctttgcagga 
2341 aaagctttta gcgtctcaga aattgagaga gtcagagact tctgtgacaa cagcacaagc 
2401 agcaggagac ccagaatatc tagagcagcc atcaagaagt gatttctcaa agcacttgaa 
2461 agaagaaact attcaaataa ttaccaaggc atcacatgag catgaagata aaagtcctga 
2521 aacagttttg cagtcggcaa ttaagttgga atatgcaagg ttggttaagt tggcccaaga 
2581 agacacccca ccagaaaccg attatcgttt acatcatgta gtggtctact ttatccagaa 
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> 2641 ccaggcacca aagaaaatta ttgagaaaac attactagaa caatttggag atagaaattt 

> 2701 gagttttgat gaaaggtgtc acaacataat gaaagttgct caagccaaac tggaaatgat 

> 2761 aaaacctgaa gaagtaaact tggaggaata tgaggagtgg catcaggatt ataggaaatt 

> 2821 cagggaaaca aclatgtatc tcataattgg gctagaaaat tttcaaagag aaagttatat 
5 > 2881 agattccttg clgttcctca tctgtgctta tcagaataac aaagaactct tgtctaaagg 

> 2941 cttatacaga ggacatgatg aagaattgat atcacattat agaagagaat gtttgctaaa 

> 3001 attaaatgag caagccgcag aactcttcga atctggagag gatcgagaag taaacaatgg 

> 3061 tttgattatc atgaatgagt ttattgtccc atttttgcca ttattactgg tggatgaaat 

> 3121 ggaagaaaag gatataclag ctgtagaaga tatgagaaat cgatggtgtt cctaccttgg 
10 > 3181 tcaagaaatg gaaccacacc tccaagaaaa gctgacagat tttttgccaa aactgcttga 

> 3241 ttgttctatg gagattaaaa gtttccatga gccaccgaag ttaccttcat attccacgca 

> 3301 tgaaclctgt gagcgatttg cccgaatcat gttgtccctc agtcgaactc ctgctgatgg 

> 3361 aagataa 
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<210> 1 
<211> 1055 
<212> PRT 

<213> Homo sapiens 
<400> 1 

Met Thr Val Glu Gin Asn Val Leu Gin Gin Ser Ala Ala Gin Lys His 
15 10 15 

Gin Gin Thr Phe Leu Asn Gin Leu Arg Glu lie Thr Gly lie Asn Asp 
20 25 30 

Thr Gin lie Leu Gin Gin Ala Leu Lys Asp Ser Asn Gly Asn Leu Glu 
35 40 45 

Leu Ala Val Ala Phe Leu Thr Ala Lys Asn Ala Lys Thr Pro Gin Gin 
50 55 60 

Glu Glu Thr Thr Tyr Tyr Gin Thr Ala Leu Pro Gly Asn Asp Arg Tyr 
65 70 75 80 

lie Ser Val Gly Ser Gin Ala Asp Thr Asn Val lie Asp Leu Thr Gly 
85 90 95 

Asp Asp Lys Asp Asp Leu Gin Arg Ala lie Ala Leu Ser Leu Ala Glu 
100 105 110 
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Ser Asn Arg Ala Phe Arg Glu Thr Gly lie Thr Asp Glu Glu Gin Ala 
115 120 125 

lie Ser Arg Val Leu Glu Ala Ser He Ala Glu Asn Lys Ala Cys Leu 
130 135 140 

Lys Arg Thr Pro Thr Glu Val Trp Arg Asp Ser Arg Asn Pro Tyr Asp 
145 150 155 160 

Arg Lys Arg Gin Asp Lys Ala Pro Val Gly Leu Lys Asn Val Gly Asn 
165 170 175 

Thr Cys Trp Phe Ser Ala Val He Gin Ser Leu Phe Asn Leu Leu Glu 
180 185 190 

Phe Arg Arg Leu Val Leu Asn Tyr Lys Pro Pro ser Asn Ala Gin Asp 
195 200 205 

Leu Pro Arg Asn Gin Lys Glu His Arg Asn Leu Pro Phe Met Arg Glu 
210 215 220 

Leu Arg Tyr Leu Phe Ala Leu Leu Val Gly Thr Lys Arg Lys Tyr Val 
225 230 235 240 

Asp Pro Ser Arg Ala Val Glu He Leu Lys Asp Ala Phe Lys Ser Asn 
245 250 255 

Asp Ser Gin Gin Gin Asp Val Ser Glu Phe Thr His Lys Leu Leu Asp 
260 265 270 

Trp Leu Glu Asp Ala Phe Gin Met Lys Ala Glu Glu Glu Thr Asp Glu 
275 280 285 

Glu Lys Pro Lys Asn Pro Met Val Glu Leu Phe Tyr Gly Arg Phe Leu 
290 295 300 

Ala Val Gly Val Leu Glu Gly Lys Lys Phe Glu Asn Thr Glu Met Phe 
305 310 315 320 

Gly Gin Tyr Pro Leu Gin Val Asn Gly, Phe Lys Asp Leu His Glu Cys 
325 330 335 

Leu Glu Ala Ala Met He Glu Gly Glu He Glu Ser Leu His Ser Glu 
340 345 350 

Asn Ser Gly Lys Ser Gly Gin Glu His Trp Phe Thr Glu Leu Pro Pro 
355 360 365 
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Val Leu Thr Phe Glu Leu Ser Arg Phe Glu Phe Asn Gin Ala Leu Gly 
370 375 380 

Arg Pro Glu Lys lie His Asn Lys Leu Glu Phe Pro Gin Val Leu Tyr 
385 390 395 400 

Leu Asp Arg Tyr Met His Arg Asn Arg Glu lie Thr Arg lie Lys Arg 
405 410 415 

Glu Glu lie Lys Arg Leu Lys Asp Tyr Leu Thr Val Leu Gin Gin Arg 
420 425 430 

Leu Glu Arg Tyr Leu Ser Tyr Gly Ser Gly Pro Lys Arg Phe Pro Leu 
435 440 445 

Val Asp Val Leu Gin Tyr Ala Leu Glu Phe Ala Ser Ser Lys Pro Val 
450 455 460 

Cys Thr Ser Pro Val Asp Asp lie Asp Ala Ser Ser Pro Pro Ser Gly 
465 470 475 480 

Ser lie Pro Ser Gin Thr Leu Pro Ser Thr Thr Glu Gin Gin Gly Ala 
485 490 495 

Leu Ser Ser Glu Leu Pro Ser Thr Ser Pro Ser Ser Val Ala Ala lie 
500 505 510 

Ser Ser Arg Ser Val lie His Lys Pro Phe Thr Gin Ser Arg lie Pro 
515 520 525 

Pro Asp Leu Pro Met His Pro Ala Pro Arg His lie Thr Glu Glu Glu 
530 535 540 

Leu Ser Val Leu Glu Ser Cys Leu His Arg Trp Arg Thr Glu lie Glu 
545 550 555 560 

Asn Asp Thr Arg Asp Leu Gin Glu Ser lie Ser Arg lie His Arg Thr 
565 570 575 

lie Glu Leu Met Tyr Ser Asp Lys Ser Met lie Gin Val Pro Tyr Arg 
580 585 590 

Leu His Ala Val Leu Val His Glu Gly Gin Ala Asn Ala Gly His Tyr 
595 600 605 

Trp Ala Tyr lie Phe Asp His Arg Glu Ser Arg Trp Met Lys Tyr Asn 
610 615 620 
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Asp He Ala Val Thr Lys Ser Ser Trp Glu Glu Leu Val Arg Asp Ser 
625 630 635 640 

Phe Gly Gly Tyr Arg Asn Ala Ser Ala Tyr Cys Leu Met Tyr He Asn 
645 650 655 

Asp Lys Ala Gin Phe Leu He Gin Glu Glu Phe Asn Lys Glu Thr Gly 
660 665 670 

Gin Pro Leu Val Gly He Glu Thr Leu Pro Pro Asp Leu Arg Asp Phe 
675 680 685 

Val Glu Glu Asp Asn Gin Arg Phe Glu Lys Glu Leu Glu Glu Trp Asp 
690 695 700 

Ala Gin Leu Ala Gin Lys Ala Leu Gin Glu Lys Leu Leu Ala Ser Gin 
705 710 715 720 

Lys Leu Arg Glu Ser Glu Thr Ser Val Thr Thr Ala Gin Ala Ala Gly 
725 730 735 

Asp Pro Glu Tyr Leu Glu Gin Pro Ser Arg Ser Asp Phe Ser Lys His 
740 745 750 

Leu Lys Glu Glu Thr He Gin He He Thr Lys Ala Ser His Glu His 
755 760 765 

Glu Asp Lys Ser Pro Glu Thr Val Leu Gin Ser Ala He Lys Leu Glu 
770 775 780 

Tyr Ala Arg Leu Val Lys Leu Ala Gin Glu Asp Thr Pro Pro Glu Thr 
785 790 795 800 

Asp Tyr Arg Leu His His Val Val Val Tyr Phe He Gin Asn Gin Ala 
805 810 815 

Pro Lys Lys He He Glu Lys Thr Leu Leu Glu Gin Phe Gly Asp Arg 
820 825 830 

Asn Leu Ser Phe Asp Glu Arg Cys His Asn He Met Lys Val Ala Gin 
835 840 845 

Ala Lys Leu Glu Met He Lys Pro Glu Glu Val Asn Leu Glu Glu Tyr 
850 855 860 

Glu Glu Trp His Gin Asp Tyr Arg Lys Phe Arg Glu Thr Thr Met Tyr 
865 870 875 880 
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Leu lie lie Gly Leu Glu Asn Phe Gin Arg Glu Ser Tyr lie Asp Ser 
885 890 895 

Leu Leu Phe Leu lie Cys Ala Tyr Gin Asn Asn Lys Glu Leu Leu Ser 
900 905 910 

Lys Gly Leu Tyr Arg Gly His Asp Glu Glu Leu lie Ser His Tyr Arg 
915 920 925 

Arg Glu Cys Leu Leu Lys Leu Asn Glu Gin Ala Ala Glu Leu Phe Glu 
930 935 940 

Ser Gly Glu Asp Arg Glu Val Asn Asn Gly Leu lie lie Met Asn Glu 
945 950 955 960 

Phe lie Val Pro Phe Leu Pro Leu Leu Leu Val Asp Glu Met Glu Glu 
965 970 975 

Lys Asp He Leu Ala Val Glu Asp Met Arg Asn Arg Trp Cys Ser Tyr 
980 985 990 

Leu Gly Gin Glu Met Glu Pro His Leu Gin Glu Lys Leu Thr Asp Phe 
995 1000 1005 

Leu Pro Lys Leu Leu Asp Cys Ser Met Glu He Lys Ser Phe His Glu 
1010 1015 1020 

Pro Pro Lys Leu Pro Ser Tyr Ser Thr His Glu Leu Cys Glu Arg Phe 
1025 1030 1035 1040 

Ala Arg He Met Leu Ser Leu Ser Arg Thr Pro Ala Asp Gly Arg 
1045 1050 1055 



<210> 2 
<211> 18 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Gly Leu Lys Asn Val Gly Asn Thr Cys Trp Phe Ser Ala Val lie Gin 
15 10 15 



Ser Leu 
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<210> 3 
<211> 17 
<212> PRT 

<213> Homo sapiens 
<400> 3 

Gin Gin Asp Val Ser Glu Phe Thr His Lys Leu Leu Asp Trp Leu Glu 
15 10 15 

Asp 
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<210> 4 
<211> 21 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Tyr Arg Leu His Ala Val Leu Val His Glu Gly Gin Ala Asn Ala Gly 
15 10 15 

His Tyr Trp Ala Tyr 
20 



<210> 5 
<211> 1087 
<212> PRT 

<213> Homo sapiens 
<400> 5 

Met Thr Val Glu Gin Asn Val Leu Gin Gin Ser Ala Ala Gin Lys His 
15 10 15 

Gin Gin Thr Phe Leu Asn Gin Leu Arg Glu lie Thr Gly lie Asn Asp 
20 25 30 

Thr Gin lie Leu Gin Gin Ala Leu Lys Asp Ser Asn Gly Asn Leu Glu 
35 40 45 

Leu Ala Val Ala Phe Leu Thr Ala Lys Asn Ala Lys Thr Pro Gin Gin 
50 55 60 

Glu Glu Thr Thr Tyr Tyr Gin Thr Ala Leu Pro Gly Asn Asp Arg Tyr 
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65 70 75 80 

lie Ser Val Gly Ser Gin Ala Asp Thr Asn Val lie Asp Leu Thr Gly 
85 90 95 

Asp Asp Lys Asp Asp Leu Gin Arg Ala lie Ala Leu Ser Leu Ala Glu 
100 105 110 

Ser Asn Arg Ala Phe Arg Glu Thr Gly lie Thr Asp Glu Glu Gin Ala 
115 120 125 

lie Ser Arg Val Leu Glu Ala Ser lie Ala Glu Asn Lys Ala Cys Leu 
130 135 140 

Lys Arg Thr Pro Thr Glu Val Trp Arg Asp Ser Arg Asn Pro Tyr Asp 
145 150 155 160 

Arg Lys Arg Gin Asp Lys Ala Pro Val Gly Leu Lys Asn Val Gly Asn 
165 170 175 

Thr Cys Trp Phe Ser Ala Val lie Gin Ser Leu Phe Asn Leu Leu Glu 
180 185 190 

Phe Arg Arg Leu Val Leu Asn Tyr Lys Pro Pro Ser Asn Ala Gin Asp 
195 200 205 

Leu Pro Arg Asn Gin Lys Glu His Arg Asn Leu Pro Phe Met Arg Glu 
210 215 220 

Leu Arg Tyr Leu Phe Ala Leu Leu Val Gly Thr Lys Arg Lys Tyr Val 
225 230 235 240 

Asp Pro Ser Arg Ala Val Glu lie Leu Lys Asp Ala Phe Lys Ser Asn 
245 250 255 

Asp Ser Gin Gin Gin Asp Val Ser Glu Phe Thr His Lys Leu Leu Asp 
260 265 270 

Trp Leu Glu Asp Ala Phe Gin Met Lys Ala Glu Glu Glu Thr Asp Glu 
275 280 285 

Glu Lys Pro Lys Asn Pro Met Val Glu Leu Phe Tyr Gly Arg Phe Leu 
290 295 300 

Ala Val Gly Val Leu Glu Gly Lys Lys Phe Glu Asn Thr Glu Met Phe 
305 310 315 320 

Gly Gin Tyr Pro Leu Gin Val Asn Gly Phe Lys Asp Leu His Glu Cys 
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325 330 335 

Leu Glu Ala Ala Met lie Glu Gly Glu lie Glu Ser Leu His Ser Glu 
340 345 350 

Asn Ser Gly Lys Ser Gly Gin Glu His Trp Phe Thr Glu Leu Pro Pro 
355 360 365 

Val Leu Thr Phe Glu Leu Ser Arg Phe Glu Phe Asn Gin Ala Leu Gly 
370 375 380 

Arg Pro Glu Lys lie His Asn Lys Leu Glu Phe Pro Gin Val Leu Tyr 
385 390 395 400 

Leu Asp Arg Tyr Met His Arg Asn Arg Glu lie Thr Arg He Lys Arg 
405 410 . 415 

Glu Glu He Lys Arg Leu Lys Asp Tyr Leu Thr Val Leu Gin Gin Arg 
420 425 430 

Leu Glu Arg Tyr Leu Ser Tyr Gly Ser Gly Pro Lys Arg Phe Pro Leu 
435 440 445 

Val Asp Val Leu Gin Tyr Ala Leu Glu Phe Ala Ser Ser Lys Pro Val 
450 455 460 

Cys Thr Ser Pro Val Asp Asp lie Asp Ala Ser Ser Pro Pro Ser Gly 
465 470 475 480 

Ser He Pro Ser Gin Thr Leu Pro Ser Thr Thr Glu Gin Gin Gly Ala 
485 490 495 

Leu Ser Ser Glu Leu Pro Ser Thr Ser Pro Ser Ser Val Ala Ala He 
500 505 510 

Ser Ser Arg Ser Val He His Lys Pro Phe Thr Gin Ser Arg He Pro 
515 520 525 

Pro Asp Leu Pro Met His Pro Ala Pro Arg His He Thr Glu Glu Lys 
530 535 540 

Leu Ser Val Leu Glu Ser Cys Leu His Arg Trp Arg Thr Glu He Glu 
545 550 555 560 

Asn Asp Thr Arg Asp Leu Gin Glu Ser He Ser Arg He His Arg Thr 
565 570 575 

He Glu Leu Met Tyr Ser Asp Lys Ser Met He Gin Val Pro Tyr Arg 
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580 585 590 

Leu His Ala Val Leu Val His Glu Gly Gin Ala Asn Ala Gly His Tyr 
595 600 605 

Trp Ala Tyr lie Phe Asp His Arg Glu Ser Arg Trp Met Lys Tyr Asn 
610 615 620 

Asp lie Ala Val Thr Lys Ser Ser Trp Glu Glu Leu Val Arg Asp Ser 
625 630 635 640 

Phe Gly Gly Tyr Arg Asn Ala Ser Ala Tyr Cys Leu Met Tyr lie Asn 
645 650 655 

Asp Lys Ala Gin Phe Leu lie Gin Glu Glu Phe Asn Lys Glu Thr Gly 
660 665 670 

Gin Pro Leu Val Gly lie Glu Thr Leu Pro Pro Asp Leu Arg Asp Phe 
675 680 685 

Val Glu Glu Asp Asn Gin Arg Phe Glu Lys Glu Leu Glu Glu Trp Asp 
690 695 700 

Ala Gin Leu Ala Gin Lys Ala Leu Gin Glu Lys Leu Leu Ala Ser Gin 
705 710 715 720 

Lys Leu Arg Glu Ser Glu Thr Ser Val Thr Thr Ala Gin Ala Ala Gly 
725 730 735 

Asp Pro Glu Tyr Leu Glu Gin Pro Ser Arg Ser Asp Phe Ser Lys His 
740 745 750 

Leu Lys Glu Glu Thr lie Gin lie lie Thr Lys Ala Ser His Glu His 
755 760 765 

Glu Asp Lys Ser Pro Glu Thr Val Leu Gin Ser lie Met Met Thr Pro 
770 775 780 

Asn Met Gin Gly lie lie Met Ala lie Gly Lys Ser Arg Ser Val Tyr 
785 790 795 800 

Asp Arg Cys Gly Pro Glu Ala Gly Phe Phe Lys Ala He Lys Leu Glu 
805 810 815 

Tyr Ala Arg Leu Val Lys Leu Ala Gin Glu Asp Thr Pro Pro Glu Thr 
820 825 830 

Asp Tyr Arg Leu His His Val Val Val Tyr Phe He Gin Asn Gin Ala 
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835 840 845 

Pro Lys Lys lie lie Glu Lys Thr Leu Leu Glu Gin Phe Gly Asp Arg 
850 855 860 

Asn Leu Ser Phe Asp Glu Arg Cys His Asn lie Met Lys Val Ala Gin 
865 870 875 880 

Ala Lys Leu Glu Met lie Lys Pro Glu Glu Val Asn Leu Glu Glu Tyr 
885 890 895 

Glu Glu Trp His Gin Asp Tyr Arg Lys Phe Arg Glu Thr Thr Met Tyr 
900 905 910 

Leu lie lie Gly Leu Glu Asn Phe Gin Arg Glu Ser Tyr lie Asp Ser 
915 920 925 

Leu Leu Phe Leu lie Cys Ala Tyr Gin Asn Asn Lys Glu Leu Leu Ser 
930 935 940 

Lys Gly Leu Tyr Arg Gly His Asp Glu Glu Leu lie Ser His Tyr Arg 
945 950 955 960 

Arg Glu Cys Leu Leu Lys Leu Asn Glu Gin Ala Ala Glu Leu Phe Glu 
965 970 975 

Ser Gly Glu Asp Arg Glu Val Asn Asn Gly Leu lie lie Met Asn Glu 
980 985 990 

Phe lie Val Pro Phe Leu Pro Leu Leu Leu Val Asp Glu Met Glu Glu 
995 1000 1005 

Lys Asp lie Leu Ala Val Glu Asp Met Arg Asn Arg Trp Cys Ser Tyr 
1010 1015 1020 

Leu Gly Gin Glu Met Glu Pro His Leu Gin Glu Lys Leu Thr Asp Phe 
1025 1030 1035 1040 

Leu Pro Lys Leu Leu Asp Cys Ser Met Glu He Lys Ser Phe His Glu 
1045 1050 1055 

Pro Pro Lys Leu Pro Ser Tyr Ser Thr His Glu Leu Cys Glu Arg Phe 
1060 1065 1070 

Ala Arg He Met Leu Ser Leu Ser Arg Thr Pro Ala Asp Gly Arg 
1075 1080 1085 
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<210> 6 

<211> 3803 

<212> DNA 

<213> Homo sapiens 

<400> 6 

acagtcggcg tttcgccgcc tgcccgcggt gcccgcgcac gccggccgcc atcgccttcg 60 
cgcctggctg gcgggggcgc tgtcctccca ggccgtccgc gccgctccct ggagctcggc 120 
ggagcgcggc agccagggcc ggcggaggcg cgaggagccg ggcgccaccg ccgccgccgc 180 
cgccgccgcc gcgggggcca tgaccgtgga gcagaacgtg ctgcagcaga gcgcggcgca 240 
gaagcaccag cagacgtttt tgaatcaact gagagaaatt acggggatta atgacaccca 300 
gatactacag caagccttga aggatagtaa tggaaacttg gaattagcag tggctttcct 360 
tactgcgaag aatgctaaga cccctcagca ggaggagaca acttactacc aaacagcact 420 
tcctggcaat gatagataca tcagtgtggg aagccaagca gatacaaatg tgattgatct 480 
cactggagat gataaagatg atcttcagag agcaattgcc ttgagtttgg ccgaatcaaa 540 
cagggcattc agggagactg gaataactga tgaggaacaa gccattagca gagttcttga 600 
agccagcata gcagagaata aagcatgttt gaagaggaca cctacagaag tttggaggga 660 
ttctcgaaac ccttatgata gaaaaagaca ggacaaagct cccgttgggc taaagaatgt 720 
tggcaatact tgttggttta gtgctgttat tcagtcatta tttaatcttt tggaatttag 780 
aagattagtt ctgaattaca agcctccatc aaatgctcaa gatttacccc gaaaccaaaa 840 
ggaacatcgg aatttgcctt ttatgcgtga gctgaggtat ctatttgcac ttcttgttgg 900 
taccaaaagg aagtatgttg atccatcaag agcagttgaa attcttaagg atgctttcaa 960 
atcaaatgac tcacagcagc aagatgtgag tgagtttaca cacaaattat tagattggtt 1020 
agaagatgcc ttccaaatga aagctgaaga ggagacggat gaagagaagc caaagaaccc 1080 
catggtagag ttgttctatg gcagattcct ggctgtggga gtacttgaag gtaaaaaatt 1140 
tgaaaacact gaaatgtttg gtcagtaccc acttcaggtc aatgggttca aagatctgca 1200 
tgagtgccta gaagctgcaa tgattgaagg agaaattgag tctttacatt cagagaattc 1260 
aggaaaatca ggccaagagc attggtttac tgaattacca cctgtgttaa catttgaatt 1320 
gtcaagattt gaatttaatc aggcattggg aagaccagaa aaaattcaca acaaattaga 1380 
atttccccaa gttttatatt tggacagata catgcacaga aacagagaaa taacaagaat 1440 
taagagggaa gagatcaaga gactgaaaga ttacctcacg gtattacaac aaaggctaga 1500 
aagatattta agctatggtt ccggtcccaa acgattcccc ttggtagatg ttcttcagta 1560 
tgcattggaa tttgcctcaa gtaaacctgt ttgcacttct cctgttgacg atattgacgc 1620 
tagttcccca cctagtggtt ccataccatc acagacatta ccaagcacaa cagaacaaca 1680 
gggagcccta tcttcagaac tgccaagcac atcaccttca tcagttgctg ccatttcatc 1740 
gagatcagta atacacaaac catttactca gtcccggata cctccagatt tgcccatgca 1800 
tccggcacca aggcacataa cggaggaaga actt.tctgt:g ctggaaagtt gtttacatcg 1860 
ctggaggaca gaaatagaaa atgacaccag agatttgcag gaaagcatat ccagaatcca 1920 
tcgaacaatt gaattaatgt actctgacaa atctatgata caagttcctt atcgattaca 1980 
tgccgtttta gttcacgaag gccaagctaa tgctgggcac tactgggcat atatttttga 2040 
tcatcgtgaa agcagatgga tgaagtacaa tgatattgct gtgacaaaat catcatggga 2100 
agagctagtg agggactctt ttggtggtta tagaaatgcc agtgcatact gtttaatgta 2160 
cataaatgat aaggcacagt tcctaataca agaggagttt aataaagaaa ctgggcagcc 2220 
ccttgttggt atagaaacat taccaccgga tttgagagat tttgttgagg aagacaacca 2280 
acgatttgaa aaagaactag aagaatggga tgcacaactt gcccagaaag ctttgcagga 2340 
aaagctttta gcgtctcaga aattgagaga gtcagagact tctgtgacaa cagcacaagc 2400 
agcaggagac ccagaatatc tagagcagcc atcaagaagt gatttctcaa agcacttgaa 2460 
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agaagaaact attcaaataa ttaccaaggc atcacatgag catgaagata aaagtcctga 252 0 
aacagttttg cagtcggcaa ttaagttgga atatgcaagg ttggttaagt tggcccaaga 2580 
agacacccca ccagaaaccg attatcgttt acatcatgta gtggtctact ttatccagaa 2640 
ccaggcacca aagaaaatta ttgagaaaac attactagaa caatttggag atagaaattt 2700 
gagttttgat gaaaggtgtc acaacataat gaaagttgct caagccaaac tggaaatgat 2760 
aaaacctgaa gaagtaaact tggaggaata tgaggagtgg catcaggatt ataggaaatt 2820 
cagggaaaca actatgtatc tcataattgg gctagaaaat tttcaaagag aaagttatat 2880 
agattccttg ctgttcctca tctgtgctta tcagaataac aaagaactct tgtctaaagg 2940 
cttatacaga ggacatgatg aagaattgat atcacattat agaagagaat gtttgctaaa 3000 
attaaatgag caagccgcag aactcttcga atctggagag gatcgagaag taaacaatgg 3060 
tttgattatc atgaatgagt ttattgtccc atttttgcca ttattactgg tggatgaaat 3120 
ggaagaaaag gatatactag ctgtagaaga tatgagaaat cgatggtgtt cctaccttgg 3180 
tcaagaaatg gaaccacacc tccaagaaaa gctgacagat tttttgccaa aactgcttga 3240 
ttgttctatg gagattaaaa gtttccatga gccaccgaag ttaccttcat attccacgca 3300 
tgaactctgt gagcgatttg cccgaatcat gttgtccctc agtcgaactc ctgctgatgg 3360 
aagataaact gcacactttc cctgaacaca ctgtataaac tctttttagt tcttaaccct 3420 
tgccttcctg tcacagggtt tgcttgttgc tgctatagtt tttaactttt ttttatttta 3480 
ataactgcaa aagacaaaat gactatacag actttagtca gactgcagac aataaagctg 3540 
aaaatcgcat ggcgctcaga cattttaacc ggaactgatg tataatcaca aatctaattg 3600 
attttattat ggcaaaacta tgcttttgcc accttcctgt tgcagtatta ctttgctttt 3660 
atcttttctt tctcaacagc tttccattca gtctggatcc ttccatgact acagccattt 3720 
aagtgttcag cactgtgtac gatacataat atttggtagc ttgtaaatga aataaagaat 3780 
aaagttttat ttatggctac eta 3803 



<210> 7 

<211> 3169 

<212> DNA 

<213> Homo sapiens 



<400> 7 

catgacegtg gagcagaacg tgetgeagea gagegeggeg cagaagcacc ageagaegtt 60 
tttgaatcaa ctgagagaaa ttaeggggat taatgacacc cagatactac ageaagcett 120 
gaaggatagt aatggaaact tggaattagc agtggctttc ettactgega agaatgetaa 180 
gacccctcag caggaggaga caacttacta ccaaacagca cttcctggca atgatagata 240 
catcagtgtg ggaagccaag cagatacaaa tgtgattgat ctcactggag atgataaaga 300 
tgatcttcag agagcaattg ccttgagttt ggecgaatea aacagggcat tcagggagac 360 
tggaataact gatgaggaac aagecattag cagagttctt gaagecagea tagcagagaa 420 
taaagcatgt ttgaagagga cacctacaga agtttggagg gattctcgaa acccttatga 480 
tagaaaaaga caggacaaag ctcccgttgg gctaaagaat gttggcaata cttgttggtt 540 
tagtgctgtt attcagtcat tatttaatct tttggaattt agaagattag ttctgaatta 600 
caagcctcca teaaatgetc aagatttacc ccgaaaccaa aaggaacatc ggaatttgee 660 
ttttatgcgt gagctgaggt atetatttge acttcttgtt ggtaccaaaa ggaagtatgt 720 
tgatccatca agagcagttg aaattcttaa ggatgettte aaatcaaatg actcacagca 780 
gcaagatgtg agtgagttta cacacaaatt attagattgg ttagaagatg ccttccaaat 840 
gaaagctgaa gaggagaegg atgaagagaa gecaaagaac cccatggtag agttgttcta 900 
tggcagattc ctggctgtgg gagtacttga aggtaaaaaa tttgaaaaca ctgaaatgtt 960 
tggtcagtac ccacttcagg tcaatgggtt caaagatctg catgagtgee tagaagctgc 1020 
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aatgattgaa ggagaaattg agtctttaca ttcagagaat tcaggaaaat caggccaaga 1080 
gcattggttt actgaattac cacctgtgtt aacatttgaa ttgtcaagat ttgaatttaa 1140 
tcaggcattg ggaagaccag aaaaaattca caacaaatta gaatttcccc aagttttata 1200 
tttggacaga tacatgcaca gaaacagaga aataacaaga attaagaggg aagagatcaa 1260 
gagactgaaa gattacctca cggtattaca acaaaggcta gaaagatatt taagctatgg 1320 
ttccggtccc aaacgattcc ccttggtaga tgttcttcag tatgcattgg aatttgcctc 1380 
aagtaaacct gtttgcactt ctcctgttga cgatattgac gctagttccc cacctagtgg 1440 
ttccatacca tcacagacat taccaagcac aacagaacaa cagggagccc tatcttcaga 1500 
actgccaagc acatcacctt catcagttgc tgccatttca tcgagatcag taatacacaa 1560 
accatttact cagtcccgga tacctccaga tttgcccatg catccggcac caaggcacat 1620 
aacggaggaa gaactttctg tgctggaaag ttgtttacat cgctggagga cagaaataga 1680 
aaatgacacc agagatttgc aggaaagcat atccagaatc catcgaacaa ttgaattaat 1740 
gtactctgac aaatctatga tacaagttcc ttatcgatta catgccgttt tagttcacga 1800 
aggccaagct aatgctgggc actactgggc atatattttt gatcatcgtg aaagcagatg 1860 
gatgaagtac aatgatattg ctgtgacaaa atcatcatgg gaagagctag tgagggactc 1920 
ttttggtggt tatagaaatg ccagtgcata ctgtttaatg tacataaatg ataaggcaca 1980 
gttcctaata caagaggagt ttaataaaga aactgggcag ccccttgttg gtatagaaac 2040 
attaccaccg gatttgagag attttgttga ggaagacaac caacgatttg aaaaagaact 2100 
agaagaatgg gatgcacaac ttgcccagaa agctttgcag gaaaagcttt tagcgtctca 2160 
gaaattgaga gagtcagaga cttctgtgac aacagcacaa gcagcaggag acccagaata 2220 
tctagagcag ccatcaagaa gtgatttctc aaagcacttg aaagaagaaa ctattcaaat 2280 
aattaccaag gcatcacatg agcatgaaga taaaagtcct gaaacagttt tgcagtcggc 2340 
aattaagttg gaatatgcaa ggttggttaa gttggcccaa gaagacaccc caccagaaac 2400 
cgattatcgt ttacatcatg tagtggtcta ctttatccag aaccaggcac caaagaaaat 2460 
tattgagaaa acattactag aacaatttgg agatagaaat ttgagttttg atgaaaggtg 2520 
tcacaacata atgaaagttg ctcaagccaa actggaaatg ataaaacctg aagaagtaaa 2580 
cttggaggaa tatgaggagt ggcatcagga ttataggaaa ttcagggaaa caactatgta 2640 
tctcataatt gggctagaaa attttcaaag agaaagttat atagattcct tgctgttcct 2700 
catctgtgct tatcagaata acaaagaact cttgtctaaa ggcttataca gaggacatga 2760 
tgaagaattg atatcacatt atagaagaga atgtttgcta aaattaaatg agcaagccgc 2820 
agaactcttc gaatctggag aggatcgaga agtaaacaat ggtttgatta tcatgaatga 2880 
gtttattgtc ccatttttgc cattattact ggtggatgaa atggaagaaa aggatatact 2940 
agctgtagaa gatatgagaa atcgatggtg ttcctacctt ggtcaagaaa tggaaccaca 3000 
cctccaagaa aagctgacag attttttgcc aaaactgctt gattgttcta tggagattaa 3060 
aagtttccat gagccaccga agttaccttc atattccacg catgaactct gtgagcgatt 3120 
tgcccgaatc atgttgtccc tcagtcgaac tcctgctgat ggaagataa 3169 
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SEQ ID 1 protein: 

10 MTVEQ N VLQQS AAQ.KHQQTF LN Q L RE ITG1 N DTQ I LQQALKDSNGN L E LAV 
AF LTAKNAKTPQ Q EETTYYQT AL P G N D RY1 S VG SQ ADTN VIDLTG D DKDDL 
Q RAI ALSLAE SNR AFR ETG ITD EEQAI SRVLEAS I AEN KACL KRTPTE VWRD 
SRNP YD RKRQDKAPVGLKN VGNTC WFS AVIQ SL.FN LLE F R RL VLN YKP PS 
N AQDLPR MQKEHRN L P FM RELRYLFAL LVGTKR KYVDPSR AVEI LKD AFK 

15 SNDS QQQ DVS EFTH KLLD WLEDAFQ MKAEE ETDE EKP KNPM VELFYGRF 
LAVG VLEG KKF ENTEMF G QYPLQ VNGF KDLHEC LE AAM IE GE I ES LH SEN 
SGKS GQE HWFTEL P PVLTFELS RFEFNQAL GRP EKIHN KLEFPQVL YLD R 
YMHRNRE1TRI KREE I KRLKD YLTVLQQR L E RYLSYGS G P KRF P LVD VLQ YA 
LEFASSKPVCTSPVDD1DASSPPSGSIPSQTLPSTTEQQGALSSELPSTSPS 

20 S VAAI SS RS V I H KP FTQ SR1PPDLPMH PAP RHITEE E LSVLESCLH R WRTE IE 
N DTRDLQES IS Rl H RT I ELMYSDKSM [QVP YRLH AVL VHEG QAN AGH YWAY 
I F D H RES RWM KYND I AVTKS S WEELVRDS FGGYRN AS A YC LM Yl N D KAQ F L 
I Q EEFNKETGQ P LVGI ETLP PDLRDF VEE D N Q R FEKELE E WDAQL AQKALQ 
E KLLAS Q KLRES ETSVTTAQAAG D P E YLE QP S RS D F SKH L KEETIQ I ITKAS H 

25 EH EDKS PETVLQ S Al KLE Y ARLVKL AQ E DTP PETDYRL H H VWYF I Q N Q AP K 
K1IEKTLLEQFGDRNLSFDERCKWIMKVAQAKLEMIKPEEVNLEEYEEWHQD 
YRKFRETTMYL1 1 G LE NFQ RESYIDS L L FL I C AYQN N KEL LSKGLYRGHDEE L I 
SHYRREC LLKLN EQAAE LF E SGEDRE VN N G L I IM N E F [VP F L P LLLVDEM EEK 
D I LAVE DM Rlsl R WC S YLGQ EM EPHLQ EKLTDFLPKLLD C S ME I KSFHE P P KL 

30 PSYSTHELCERFARIMLSLSRTPADGR 




• 
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SEC ID no 2: {polypeptide) 
GLKNVGNTCWFSAVIQSL 

SEQ ID no 3: (polypeptide) 
5 Q QDVSE FTH KLLD WLED 

SEQ ID no 4: (polypeptide) 
YRLH AVLVH £ GQAN AG HYWAY 

10 SEQ ID no 5: (polypeptide) 

MTVES QNVLQQ S AAQKHQ QTFL NQLRE ITG INDTQ I LQQ ALKDSNG N LELA 
VAFLT AKN AKTPOQEETTYYQTALPGND R YISVG SQ ADTN VIDLTG D DKD 
□ LQ RAIALS LAESNRAFRETG1TDE EQAIS RVLEAS IAEM KACLKRTPTEVW 
RDS R N P YDR KRQ 0 KAPVGLKN VGNTCWF SAVI Q SLFNLLE FRRL VL NYKP 

] 5 PS N AQ D LPR NQKE H RN L P F MRELRYLFAL L VGT KRKYVD PS R AVEJ LKDA 
F KSN D S Q Q QD VS EFTH KLLD WLE D AFQM KAE EETDE E KPKN P MVE L FY 
G RF LAVG VLEG KKFENTEM FG Q YPLQ VNGFKDLHE C LEAAM IEGE I ES L H 
SENSG KSGQE HWFTE L P PVLTFELSRFEFNQALGRP EKI H N KLEFPQVL Y 
LDRYMH RNRE ITRIKRE E IKRLKD YLTVLQ Q RLERYLS YGSGPKRFPL VD V 

20 LQ YALEFAS S KPVCTS P VDDI D AS SPPSG S i P SQTLP STTEQ.Q GAL S SE L P 
STS P SS VAAI SS RSV I H KP FTQS R I PPDLPMH PAP RHITE E KLS VL E SCLH R 
WRTEI E N DTRD LQ ESI S RIHRTI E LM YSDKSM I Q VP YR L H AVLVHEGQ AN A 
G H YWAYIFDHRES RWMKYN D EAVTKS SWEELVRD SFG GYRNASAYC LM 
Y I N DKAQ F L [Q EE FNKETQQ PLVGI ETLP P DLRDFVEE DNQRFEKE LEE W 

25 DAQ L AQ KALQE KLLASQ KLRES ETS VTTAQ AAG D PE YLEQ PSRSDFS KH 
LKEET I Q I ITKAS H E H ED KSP ETVLQS I MMTP N MQ G I IM A] GKSRS VYDRCG 
P EAGFFKAIKL EYAR L VKLAQ E DTP P ETD YRLHHVWYF I Q NQAP KKI I EKT 
L L EQ F G D RN LSFDERCH Isl 1M KVAQAKL E Ml KPEEVN LEE YEE WHQD YR K 
FRETTM YL I IGLE N FQ RESYI DSLLF LICAYQ N N KELLSKGL YRGKDEE LIS 

30 H YRREC L LKLN EQAAE LF E S G E D RE VN N G Li I MN E FJVPFLPLLLVDEM EE 
KD IL AVE DM RNR WC S YLG Q E M EPHLQ EKLTDFLPKLL DCS MEI KS F H EP 
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5 SEQ SO no 6; 
DNA 

>BASE COUNT 1223 a 752 c B41 g 657 1 
>ORIGlM 

> 1 acafltcggcg tticgccgcc JgcGcgcggf gcccgcgcac gccggccgcc atcgccilcg 
10 > 61 eg cciggctg g egggggege igtcctccta ggccglocg c gccgtfcccl ggagctegge 

> 121 ggagogegge agocagggee ggeggaggog cgssB&flCcg QffCQ ccaoog ccgccgccgc 

> 1 a 1 csccgcc^cc gcggg «coa tgaccatgg a geagaaegtg dgcag caga gcgcgg&gws 
- 241 gaagcaccag cagacgltH igaatcaact gagagaaalt seggggatta algawsccca 

> aq i gatactacag caagocttga aseetagiaa tggaaacttg gaattagcag 1 ggefttert 

15 > aei (actgegaag aatgetaaga occctcagca ggaggagaca actiactfscc aaacagcact 

> 42 1 toctggcaat gatagataca tcagtgtgg g aagecaagea gatacaaat-g tgaUgatat 

> 4«1 cecfragagal gertaasgertg atcttcagafl agcaattgccttgagtttgg ccgaatcaaa 

> 541 cagggcatlc agggagactg gaataactga Igsygaacaa gecaiiagea gagUcttga 

> 601 agecsgcata gcagagaafia aagcatgttt gaagaggaca crtaGagaag tttggaggga 
20 > 661 ttctcgaaac ccttalgata gaaaaagaca ggacaaagc* ocogllgggG taaagaatgt 

> 721 tgg caaiad tgttggttta gtgctgUal Icag tcatf a fftaatcttt tggaattlag 

> 781 aagattagtt ctgaallaca agcotccatc aaalgclcaa gatttaocGC gaaaccaaaa 
=> 541 sgaacategg aatltgcctt ttatgcgtga gcigaggiaf ctattlgcac ttcttgttgg 

> 901 tacraaaagg aagiaf gttg atecatcaag agcagttgaa attcftaagg atgctttcaa 
25 > 961 atcaaalgac tcacagcagc- aagalglgsg tgagtttaca cacaaattat tagatfggtt 

> 1021 agaagafgee ttacaaalfla aagctgaaga ggagacggal gaagagaagc caaagaaccc 

> 10S1 catggfagag Itgffctalg gcagaltecl ggclgtggga gtaettgaag gtaaaaaatt 

> 1141 tgaaaacact gaaatgtttg gicagtaocc acttcaggtc aaigggttca aagaictgca 

> 1201 tgagtgccta gaagclflcaa tgattgaagg agaaattgag tcmacalt cagagaaUc 
30 > 1 261 aggaaaat ea ^gecaegsgc attggmae tgaaiLacca cctfllgttae catttgaatl 

> 1321 gicaagattt gaatttaatc aogcaitggg asgaccagaa oaaallcaca acaaattaga 

> autccccaa gttttatatt tggacagata catgeacaga aacagagaaa laacaagaat 

> 1441 taagagggaa gagatcaaga gactgaaaga ttacctcacg gtattacaac aaaggctaga 

> 1501 aagataUta agtf atggtl ccggtcccaa acgattccoc ttggtagatg ttcttcagts 
35 > 15*1 tgcattggaa tugoefcaa glaaacctgt ttgcacltat cctgffgacg atattgaege 

> 1621 tegifcccca cdagtggtl ccatancatc acagacatta ccaagcacaa cagaecaaca 

> 16B1 gggagecrta tcttcagaac tgccaagcac atcacdica tcagttgctg ccatttcatc 

> 1741 gagaicagta atacacaeac cautaetca gtcceggata cctccagau Igoccatgca 
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> 1 B0t fcoflgcacra aggcacataa cggaggaaga acttttfgtg ufggaaagtt gftiacaiog 

> ^6Qi aggaggaca gaaafagaaa atgacaccao agatttgcag gsaagcatat ccagaatcca 

> 1S21 tcgaacaatt gaattaatgt adctgacaa atctotgala caagttcctt atogattaca 

> 1S81 tgccgtttta gtlcacgaag gccsagctaa tgctgggcsc Eacfgggcat atatttftga 

5 > 2041 icatcgtgaa agcagaigga tgaagtacse tgatattgct gtgacaaaat cafcaiggga 

- 2 1 01 egag; ctajtg agg$ ectctt ttggtggtta tagasatgcc agtgca tact grrtaaigia 

> 2f &1 cataaatgat aaggcacaQt tcdaataca agaggagttt aataaagaaa ctgggcagcc 

> 2221 ccttgttwt atagaaacat taocaccgga IMgagagat tttgttgagg asgscaacca 

> 22B1 acgatttgaa aaagaactag aagaatggga tgcacaactt geecfigaaag ctttgcagga 
1 0 > 2341 aeagctttfa gogtctc&ga aatfgagaga glcagagact tetQtgacaa cagcacaagc 

> 24D1 agcaggagac. ccagaatatc tagagcagcc atcaagaagt gsMCtcaa agcacttgaa 

> 2461 agaagaaact attcaaataa ttaccaaggc atcacatgag eeigaagata aa&gtctfgft 

> 2521 aacagttttg cagtcggcaa tt&agttgga alatgcaagg ctSQttaa^l iggcccaaga 

> 256 t agacacccca ccagaaaccg affatcgttt acatealgia gtQglctect ttsrtccagaa 
15 > 2641 ccaggcacca aagaaaatta ttgaga aaac atta ctagaa CSatttgg eg atagaaattt 

> 270 f gagmigart gaeaggtglc acaacetaat flaaagttgct caagccaaac tggaaatgaf 
31 2731 aaaacclgsa pafigtagsct tggaggaata tgaggaglgg catcaggatt aiaggaaatt 

> 2621 ea-gggsaaca actaigiatc icaiaattgg gctagaaaat tttcaaagag aaagiiaiat 

> 29^1 egattccttg ctgttcctca toatgctta te&gaal&ac aaagaactcf igtctaaagg 
20 > 2641 cttataeaga ggaeatgatg aegaaltgat Stcacattat agaagagaat gtttgctaaa 

> 30D1 aiiaaatgag csagccocag aaclcttcga alctggagag gatcgagaag taaacaatgg 

> 3DB1 utaatt&tc atljaatgagt ttattgtooc atttttgcca ttattactgg f gg a tgaaat 

> 3121 ggaagaa&ag gatetactag agtaeaaga tatgagaaat cgatggtga cctacdigg 

> 31 81 icaagaaalg gaaccacacc locaagaaaa gctgacagaf tmtgccaa aactgeiiga 
25 > 3241 Kgttclafg gag&tt&aaa gtttccalga gccaccgaag Uaccttcat ettccacgca 

> 3301 (gaactcfgf gagcgatfEg ccogaatcat gttglccctc agtcgaactc- Ctgctgatgg 

> 336 1 aagataaacf gcacactttc cctgaacaca ctgfataaac ictttttsgl ietteaocct 
5- 342t tgcdlcctg teaeagggtt IgtfSgttQC tgctatagll Hiaacfftt ttttatttta 

> 3481 ataactgcaa aagacaaaat gaclaf acag actttagtca gactgcagac aataaagctg 
30 > 3541 aaa&tcgcai ggcgctcaga cattttaaoc ggaactgatg tataaicaca ^ictaallg 

> 3601 attttaitei ggcaaaacta IgcUUgcc acctfcdgf tgcagtsua O tttgctttt 

> 3661 atcttftcll tctcaacagc tttccattca gtctggatcc ttcoatgact acagccattt 

> 3721 aagf gttcag cactgtgtac galacataat singgtegc ttgtaaatga aataaagaat 

> 37S1 aaagrttttat ttatggciac eta 

35 

SEQ JDn&7dSONA 

> cfi tgaccgtgga geagaaegtg ctgcagcagft g^geggogea 

> 241 gaagcacrag cagaegtitt igaatcaact gagagaaatt aeggggatta atgacaooca 
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> 6fl1 gatactacag caagccttga e&gatajtaa tggaaactf g gaattagcag tggctttoct 

> 291 taclgogaag aatgcfsags ©cccfccagra ggaggagscs scJEacLacc aaacagcaGf 

> 421 tcetggcaal gatagataca tcagtgtggg aagcDaagca ga(acaaatg tgaiigattf 

> 4B1 csctggagat gataaagatg atcttcagag agcaattgcc Ugagttfgg ccgaalcaaa 
5 > $41 cegggcattc agggagactg gaataaclga Sgaggaacaa gccattagca gagltcttga 

> 501 agccagcala gcagagaata aagcatglli gaagaggaca octacagaag tttggaggga 

> 961 Hctcgaaac ecttatgala gaaaaagaca ggacaaagct cccgHgggc taaagaatgl 

> 721 iggcaatact tgttfiigtua gtg ctgual icagtcaiia Lttaartcut iggaatttag 

> 791 aagattaglt ctgaattaca agcctccatc aaatgctcaa gaiitaoecc gaaaocaaaa 
10 > 841 ggaaratcgg. aatttgccU ttatgcgtfla gclgaggrtat etatttffcac ttcJlgttfffl 

> 901 laocaaaagg aagtatgttg atccatcaag agcagugaa attcttaagg atgctttcaa 

> 961 aicaaaJgatrtcscagcagc aapatgtgas t^agiuaca cacaaattat tagattggtt . 

> 1021 agaagatgcctteGaaalga aagtfgafiga ggagacggat gaagagaagc caaagaaccc 

> 1061 catggtagag itgtfcfsfg gcagsttccl ggcigiggga gJacttgaag gfaaaaaatt 
15 > 11 41 igaaaacact gaaatgtttg gtcagtaccc acttcagglc aafcgggtlca aagalctgca 

> 1201 tgagtgcctB gaagdgcaa tgaUgaagg agaaattgag tctttacatt cagagaattc 

> 1261 aggaaaelce ggccaagagc attgguiac tgaattacca cctgigttaa catttgaatt 

> 1321 glcaagattt gasmaatc aggcattggg aagaccagaa aaaalicaca acaaattaga 

> 1361 alttcoccaa gtNtatsU iggacagata catgcacaga aacagagaaa taacaagaaf 
20 > 1441 taagagogaa gsgatcaaga gactgaaaga uacctcacg gtat'tacaac aaaggctaga 

> 1S01 aagatattta egeLatgfltt ecggicccaa aogaiicccc tiggfagatg ttcttcagta 

> 1561 tgcattggaa tttecctcaa gtaaaccigt ttgcacttd cctgttgacg atattgacgc 

> 1321 tagttoocca crtaglggtt ccataccatc acagacatta ccaagcacaa cagaacaaca 

> 1981 gggagcccta tcttcagaac tgccaagcac atcacciica tcagttgctg ccattteatc 
25 > 1741 gagalcagla etecacsaac cstttactca gtcccggata cctccagatt tgoecatgee 

> 1501 tcoggcacca aggcatatefl cggaggaaga actttctgtg ctggaaagu gtttacatcg 

> ieai dggaggaca gaastagaaa sigacaccag agaiugcag gaaagcaiat ccagaeicca 

> 1 82 1 tcgaacaatt fl aattaalgt actcigscaa atctatgat a eaagttccu atcgattaca 

> 1«1 tgrogtttta gtlcacs&ag gccaa^ctea tgtfgpgcac tacteggeat atatttttga 
3D > 2041 fcategtgsa agcHgatgga tgaagtacaa igatattgct glgacaaaat catcatggga 

> 2101 agagctagtg aggg&rtcll Uggtggtta tagaaatgcc agigeatact gtttaatgtH 

> 2161 cataaalgart aaggcacagC tcctaataca egaggagtft aalaaagaBB clgggcagoc 

> 2221 cctiguggt atagaaacat tacraccgga tltgagagef fttgttgagg aagacaacca 

> 2261 acgatttgaa aaagaadag aagaalggga Igcacaactf gcccagaaag ctttgcagga 
35 > 2341 aaagctltta gcgtctcaga aattgagaga gtcagagact fclgtgacaa cagoacaagD 

> 2401 agcaggagac ccagaatatc tagagcagcc atcaagaagi gatttctcaa agcacttgaa 

> 2461 agaagaaact altcaaataa ttaccaaggc atcacatgag caitgaageia aaagtoclga 

> 2521 aacagttttg cagtc^gcaa ttaagiigga atatgcaagg ttggttaagt tggoccaaga 

> 25Q1 agacaccoca ocagaaaoog attatcgm acetcaigta giggtctact ttatooagaa 
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> 254 1 ccaggcacca asgaaaatta ttgagaaaac atfactagaa caatttggag atagaaattt 

> 2701 gagttttgat gaaaggtgtc acaacaiaat gaaagttgci caagccaaaD tggaaatgat 

> 2761 aaaacctgaa gaagtaaact fggaggaaia tgaggagtgg catcaggart ataggaaatt 

> 2621 cagggaaaca actatgtatc tcataattgg gctagaaaat tttcaaagag aaagttatat 
5 > 2BB1 agattodlg dgttcctca tctgtgclla Lcagaataac asagsactct tgttfaaagg 

> 2941 cttetacaga ggacatgatg aagaattQiat elcacattat ega&csgaat « tttg ctaaa 

> 3001 alleaatgag csa$cc0caD aactcHOQa frtciggajag g&tcffagaag taaacttfrtjg 

> 3Q&1 tllgaftatu atg aatgagt ttattglccc altwgoca ttattactgg legatgasat 

> 3121 ggaagaaaag gatatactag ctgtagaaga talgagaaaf cgatggtgll ccfacc«gg 
tO > 3181 tcaagaaatg gaaccaucc tccasgaaaa gdgacagat tttligccaa aactgcttga 

> S241 ttgttctatg g&gaUaaaa glUccatga gccaccgeag ttaccttcet attccacgca 

> 3301 tgaartctgl gag cgattig cccgaatcat gttgtacctc agtogaactc ctgctgalgg . 

> 3361 aagataa 
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<110> School Of Pharmacy 
<12D> Treatment of Cancer 

<H0> PCT/GE DO02446 
<141> 2D00-OG-22 

<1*>\> 19S9-06-22 

<150> 0006161,2 
<151> 2000-04-01 

<1£0> 7 

<170> Pat&ntln Vcr. 2.1 

<210> 1 

<;211> 1055 

<212> PfcT 

<213> Kocno sapiens 

<4DD> L 

Met Thr Val Giu Gin Asn Val Leu Gla Gin Ser Ala Ala Gin Lya His 
IB ID 15 

Gin Gin Thr Phe lau Aan Gin Leu Arg Glu lie Thx Sly II* Aer> Aep 
ZO 25 30 

TDr Gin lie Leu Gin Gin Ala Leu Lya Aap Ser Asn Gly Aan Leu Glu 
35 40 45 

Leu Ala Val Ala Phfi Leu "Thr Ala J,y$ A*n Ala Lys TMir Pre Gin Gin 
50 55 &0 

Glu Glu ThJC Thr Tyr Tyr Gin Thi Ala Leu Pro Gly Aan Aap Arg Tyz 
65 "?0 15 B0 

lie Ser Val Gly Ser Gin Ala Asp Thr Aan Val He Asp Leu The Gly 
tfS 90 95 



Asp Asp l»ys Asp Asp Leu Gin Axg Ala He Ala Lbu &ci Lev Ala Glu 
100 105 IIP 



WO PCT/GB0D/Q1446 

Ser Asn Arg Al* Phe Arg Glu Thr Gly He Thr Asp Glu Glu Gin Ala 
115 120 125 

I la S« Arg Val Leu Glu Ala &cx lie Al* Glu Asn Lys Ala Cys low 
130 135 1-30 

ly;s Arg Thr Pxo Thr Glu Val Trp Arg A*p S*r A*g Abii Pro Tyr. Asp 

iao 1&5 160 

Arg Ly* Ar? Gin Asp lys Ala Pro VaL Gly Leu Lys Asn Val Gly A=n 
165 170 i75 

Ttor Cys Trp Phe Ser Al* Val He Gin Ser Leu Phe Aan Lew Leu Glu 
1*0 155 1^0 

Phe Arg Axg Leu. Val Leu Abii Tyr Lys Pro Pro S*r A*ft Ala Gin Asp 
195 200 205 

Leu Pro Arg As ft Gin Lys Glu His Axg Asn Leu Fro Pbe Hat Arg Glu 
210 515 220 

L«u Arg Tyx Leu £he Al* Leu Leu Val Gly Thr LyB Arg Lys Ty* Val 
22^ 230 235 240 

ABp Pro ser Ar? Ala Val Glu lie Leu Lys Asp Ala Phe Lys Sex Asn 
245 2^0 255 

Asp Sex Gin Gin Gin Aap val Sex Glu Phe Thr His Ly* Leu Leu. Aap 
260 2&5 27D 

Trp Leu Glu Aep Ala Fhe Gin wet Lys Ala Glu Glu Glu Thr Asp Glu 

275 2SQ 

Glu Lys Pro Lys Asrt Pro Met val Glu Leu Phe Tyr Gly Arg Phe Leu 
2&0 29^ 300 

Ala Val Gly Val Leu Glu Gly Lys Lya Phe 61u Ain Thr Glu Met Phe 
305 3LD 320 

Gly Gin Tyr Pro Leu Gin Val Asia Gly, Ph* Ly* Asp leu Kis Glu Cys 
325 530 335 

Leu Glu Ala Ala Met II* Glu Gly Glu He. Glu Ser Leu His S«r Glu 
540 345 350 

Aan ser Gly Lya ser <51y Gin Glu Hia Trp Phe Thx Glu Leu Pro Pra 
355 . 360 365 
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WO 00/79267 FCT/CHWWD244* 

val Leu Thr Phe Glu leu Ser Arg Phe Glu Pbe Asn Gin Ala Lev Gly 
570 375 3B0 

Are Pro Glu Lya He Bia Aim Lys Leu Glu Phe Pro Gin Val l*t±u Tyr 
395 ' 390 395 400 

Leu Aap Rrg Ty* Met His Arg Asn Arg Glu He Thr Arg lie Lys Arg 
*05 410 415 

Glu Glu Jle Lys Arg Leu Lys Aep Tyr Leu Thr Val Leu Gin Gin Arg 
<J20 425 430 

Leu Glu Arg Tyr isu Scr Tyr Gly Sex Gly Pro Ly= Argf Phe Pro Leu 
435 440 445 

Val Asp val Ley Gin Ty* Ala Leu Glu Fhc Aia Sex Lys Pro Val 

-J5Q 455 460 

Cya Thr Ser Pr-D Val Asp Asp lie Asp Ala Ser Ser Pro Frp Sex Gly 
4&5 ftfO 475 4BD 

Ser 11c Pto Ser Gin Thr Leu Pre? Ser Thr Thr Glu Gin Gin Gly Ala 
485 490 495 

Leu Ser Ser Glu Leu pro ser Thr Ser Pro Ser Ser Val Ala Ala lie 
500 

Ser Ser Ar? Ser Val lie His Lys Pr& Phe Thr Gin 3sr Arg He Pro 
515 520 525 

Fro Asp Leu Pro Met His Pro Ala Pro Arg Kis 31c Thr Glu Glu Glu 
530 535 540 

L»u Scr val Leu Glu Ser Cys Leu Hi a Arg Trp Arg Thr Glu lie Glu 
545 550 555 5€0 

Asn Asp Thr Arg Asp Leu Gin Glu Ser He Sex Axg lie Hia Arg Thr 
S65 570 5T5 

lie Glu Leu Met Tyr 3er Asp Lys Set* Met lie Gin val Pro Tyr Arg 
5&D 595 590 

Leu Ria Ala Val Leu Val Hie Glu Gly Gin Al* Asn Alft Gly His Tyr 
59S 6O0 605 

Trp Ala Tyr lie Phe Aep His Arg Glu Ser Ar<? Trp Met. Xya Tyr Asn 
610 filS WO 
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Asp Xle Ala Val Thr Lys 5m Ser Trp Glu Glu Leu V*l Ar§ Asp Ser 
£25 6*0 6S5 640 

Phe Gly Gly Tyr Arg Asn Ala Ser Ala Tyr Cy= Leu Met Tyr He Acn 
6415 650 655 

Asp Ly* AlA Gin Phe Leu lie Gin Glu Glu Phe Asn Lya Glu Thr Gly 

Gin Pr* Leu val Gly lie Glu Thr Leu Pre Pro Asp Lau Ar^ Asp Phe 
675 680 6*5 

Val Glu Glu Asp Am Gin Ar9 Phe Glu Lya Glu Leu Glu Glu Trp Asp 
59Q 695 700 

Ala Gin LeU Ala Gin Lya Ala Leu Gin Glut Leu Leu Ala. Ser Gin 

7U5 710 715 720 

Lys Leu Arg Glu Ser- Glu Thr Ser Val Thr Thr Ala Gin Ala Ala Sly 
725 730 735 

Asp Pro Glu Tyr Leu Glu GIti Pre Ser Arg Ser Asp Fhc Se^ Lys His 
740 74^ 750 

Leu Lys «lu Glu Thr lie Gin lie lie Thx Lys Ala Ser His Glu Kl* 
755 760 755 

Glu Asp Ly* Ser Pro Glu Thr val Leu Glu Ser Ala He Lys Leu Glu 
770 775 

■Tyr Ala Arg Leu Val Lys Leu Ala Gin Glu Asp Thr Fro Pro Glu Thr 
7^5 790 795 600 

ABp Tyr Axg Ley His His val Val val Tyr Phe II* Gin Aan Gin Al* 

B10 B15 

Pro Lys Ly* He He Glu Lys Thr Leu LeU Glu Gin Phe Gly Asp Arg 
S20 825 B30 

A*n Leu Ser Phe Aap Glu Arp Cy*s His Aan He Met Lys val Ala Gin 
&35 840 845 

Ala Xys Lew Glu Met He Ly b Pro Glu Glu Val Asn L«U Glu Glu Tyi 
fiSO £55. &60 

Glu Glu Trp Bis Gin Asp Tyr Arg Lys Pha Axg &lw Thr Tnr Met Tyr 
665 B^O 875 BB0 
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WO 00/792*7 PCT7GB0O102446 

Leu lie lie Gly Leu Glu Ahii Phe Gin Arg Elu S€r Tyr lie Ser 

BBS BSD 995 

Leu Leu Phe Leu lie Cys Ala Tyx Gin A^n Asn Lys Glu X*eu Leu St* 
900 90S 910 

Ly5 Gly Leo Tyr Arg Gly Hia Asp Glu Glu Leu lie &er HI* Tyr AX9 

9.15 92D $25 

Glu cys Leu Leu Lys Leu Asn Glu Gin Ala A14 Glu Leu Phe Glu 
930 935 940 

&er Gly Glu Asp Arg Glu Val Asn Asn Gly Leu II* lie Met ABn Glu 
545 9&0 9^5 *S0 

Phe lie Val Pro Phe Leu Pra Leu Leu Leu Val Aap Glu Met Glu Glu 
955 970 

Lys A*p Xle Leu Ala Val Glu Aap Met Arg Aan Arg Trp Cye Ser Tyr 
SBO 9BS 

J,eu Gly Gin Glu Met Glu Pro His Lou Gin Gly Lys Leu Thr Aap Phe 

1000 1035 

Leu Pro Lys Leu L*u Asp Cys wtt Glu lie Lys Ser Phe His Glu 

1010 1015 

Pro Pro Lya Leu Pro Ser Tyx Ser Thr His Glu Leu Cy* Glu A*g Phe 
L025 10SO 1035 1040 

Ale Arg lie Met Leu Sex Leu S«r Arg Thr Pro JlIb Asp Gly Arg 
1045 1050 1D5>5 



<2l0> 2 
<3H> 1& 
<212> P?T 

<213> Kcwno sapiens 
<4DQ> 2 

Gly Leu Ly* Afift Val Gly Aen Tfcr Cye Trp Phe Ser Ala Val lie Gin 
1 & 10 is 



Ser L«V 
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<210> 3 
*211> 1? 
<212> PRT 

<213> Komo sapiens 
<40Q> 3 

Gin Gin AHp V&l Ser GLu Phe Thr His Lys Jjqu Leu Asp Trp Leu Glu 
15 ID 15 

Asp 



<210> 4 
<211> 21 
<212> FRT 

<213> Homo japiena 
<aoo> 4 

Tyr Arg Leu Bis Al± Val Leu Val His Glu Gly Gin Ala Asn Ala Gly 
15 10 15 

Mis Tyr Trp Al* Tyx 
ZD 



<210> 5 

<211> 108? 

<212> FRT 

<213> Hc*na sapiens 

<40O> 5 

Met Thr Val Glu Gin A*n Vil Leu Gift Gift Ser Ala Ala Gin Lyy Hi* 
15 10 15 

Gin Gin Thr £>he Leu Asn Gin Leu Arg Glu lie Thr Gly lie Asn A«p 
20 25 30 

Thr Sin lie Leu Gin Gin Alo Lsu Lys top £J«r Aan Gly Asn l,eu Glu 
35 40 45 

XeU Al* VAl Ala PhB Lau Thr Ala Lys Asn Ala Lye Thr Pro Gin feln 
50 55 60 

Glu Glu Thr Thr Tyr Tyr Gin Thx Ala Leu Pro Gly A*n A*p Arg Tyr 
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63 70 75 

lie ser Val Gly S«tr Gin Ala Asp Thr Asn Val II* Asp Leu Thr Gly 

P5 $0 

Asp Asp Lys Asp Asp Leu (Sin Arg Alia lit W.* Leu Ser Leu Ala Glu 
100 10b HO 

Str Aan Ar§ Ala Phe Arg Glu Thr Gly lit Thr Aep Glu Glu Gin Ala 
115 120 125 

He Ser Arg Val Leu Glu Ala Ser lie Ala Glu Asn Lys Al* Cy* Leu 
130 ^35 140 

Lye Arg Thx Pro Thr Glu Val Tip Arg Asp Ser Arg Asn Pro Tyr Asp 
145 150 155 160 

Aig Lys Artf Gin Asp Lys iU* PtP Val Gly L«u Lye- Aan Val Gly Aan 

iw no i">& 

Thr Cys Trp Phe ser Ala val He Gin &er Leu Phe Asn Leu Leu Glu 

iho ies> 1^0 

'Phe Arg Arg Lbu Val Leu Asn Tyx Lys Pro Pro Sbi Am Als Glh Aap 
19& 200 205 

Leu Pr* Are Asn Gin Ly b Glu His Arg A*tk Leu Pro Phe Met Arg Glu 
210 215 220 

Leu Arg Tyr Leu Phe Ala Leu Leu Val Gly Thr Lys Aig Ly* Ty* Val 
225 230 235 240 

Asp Pro Sex Arg Ala V*L Glu lie Leu Lys Asp Ala Phe Lys Ser Aan 

245 255 

A«£> Gin Gin Gin Asp Val Ser Glu Phe Thr Hie Lys Leu Leu Asp 

2GD 255 270 

Trp L&u Glu Asp Ala Phe Gin Het Lys Ala Glu Glu Glu Thr Asp Glu 
275 2B0 2BS 

Glu Lys Pro Lys Asn Pre? Met Vel Glu Leu Phe Tyr Gly Axr9 Ph* Leu 
290 295 300 

Ala Val Gly Val Leu Glu G£y Lys Lys Phe Glu A* ft Thr Glu Wet Phe 
305 3i0 315 32P 

Gly <3ln Tyr Pro Leu Gin Val Aan Gly Phe Lya Aap Leu His Glu Cys 



WO PCT/GB0WD2*« 

325 330 3S5 

L^u Giu Ala Ala Met lie Glu Gly «lu He Glu Ser Leu His Ser Glu 
340 545 350 

Aari Sex Gly Lya Ser Gly <31n Glu Rl£ Txp Phe Thr Glu LeU Pro Pro 
355 360 

v«l Leu Thr Ph«. Giu Leu Bex Arg Fhe Glu Phe A*n Gin Ala Leu 61 y 
370 375 3*0- 

Arg Fro Glu Lys I la Bis Asn Ij^b Leu Glu Phe Pro 5ln Val Leu Tyr 
3fc5 390 3W> 40D 

Leu fcsp Arg Tyr Met Arg Aan Arg Glu 11« Thr Arg He Xys Aig 

405 41D . 415 

Giu lie Lys Arg Leu Lya Asp Tyr L^u Thr Vsi Leu Gin Gin 
420 425 430 

Leu Glu Arg Tyr Leu Ser Tyr Gly Ser Gly Pro Lya Arg Phe Pro Leu 
43b 440 445 

Val Aap Val Leu Gin Tyr Ala Lau Glu Pbe Ala 5er Ser Lyia Pro Val 
45Q 455 460 

Cya Thr Ser Px* Val Asp Asp lie A*p Ala Str Ser Pro Pro Sex tyly 
4S5 4?a 475 460 

5?r He Pr& Ser Gin Thr Leu Pro Ser Tbx Thr Glu Gin Gin Gly Als 
485 450 495 

Leu Ser Ser Glu Leu Pro Sex Thr Sex Pxo 5er Val Ala Ala He 

500 505 510 

Sex Sex Arg 0e* Val He Kis Ly* Pro Phe Tfcr Gin Ser Arg lie Pro 
515 520 52E> 

Fro- A*p Leu Pxo Mot Kis Pro Ala Pro Arg Bis lie Thr Glu Giu Ly b 
530 535 S40 

Leu ser Val Leu Glu Ser cya Leu His Axg Trp Arg TJ*r Giu He Glu 
545 550 555 560 

A«n Aap Thx Arg Asp Leu Gin Glu Ser lie &er Arg He Hia Arg Thr 
565 570 573 

11c GlU Leu Met Tyr Ser Aap Lya Ser Met He Gin Val Pxo Tyr Arg 
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560 5B5 5£0 

Leu Kia Ala Val Leu Val HI* Glu Gly Gin Al* A*ft JUa Ely Bis Tyr 
595 600 $05 

Txp Ala Tyr lie Fhe Aep His Arg Glu Set Are Trp Met Lys Tyi Asn 
610 515 620 

Asp lie A1& Val Thr Lys Ser Ser Trp CJlu Glu Leu Val Arg Asp Ser 
625 6aa 555 640 

S>he Gly Gly Tyr Arg Asn Ala Ber Ala Tyr- CyB Leu Met Tyr lie Aan 
64S 650 655 

Aip Lyfi Ala Gin Phe Lgu lie Gin Glu Elu Phe Aan Lys Glu Thr Gly 
660 665 670 

Gin Pro Lew Val Gly lie Glu Thr Leu Pro Pitp A^p Leu Arg Asp Phe 
675 690 695 

Val Glu Glu Asp Asn Gin Arg Phe Glu Lys Glu Leu Glu Glu Trp Asp 
690 6£5 700 

Ala Gin Lau Al* Sin Lys Al* Leu Gin Glu Lys Lau Leu Ala Seje Gin 
"?0£ 710 71S 720 

Lys Glu Ser Glu Thr Ser Val Thr Thr Ala Gin Ala Ala Gly 

7£i 730 735 

Asp Pxo Glu Tyr Leu Glu Gin l?ro Sex Arg Ser Asp Phe Lye His 

740 745 750 

Leu Lye Glu Glu Thr lie Gin 12c lie Thr Lys Aln Ser Hifl Glu His 
7S5 760 765 

Glu A&p Lys Ser Pro Glu Thr Val Leu Gin Sex lie Met Mat Thr Pro 
77^ 775 7fi0 

Aan Met Gin Ely 31© lie Met Ala He Gly Ly b Ser Arg Sisr Val Tyr . 
735 7&D 7£5 BOO 

Asp Arg Cys Gly Fro Glu Al« Gly Phe Phe Lys Al* He Ly« Leu Glu 
605 910 $15 

Tyr Ala Arg Leu Val Lys Leu Al± Gin Slu Asp Thr Pro Pro Glu Thr 
B20 625 930 

Asp Tyr Arg Leu His His Val val val Tyr Phe lie Gin Asn Gin Ala 
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B35 B4C ^5 

Pro iys Lys lie Tie Glu Lys Thx Leu Leu Glu Gin Fhe Gly Asp Arq 
H50 355 660 

Asn Leu Sex Phe Asp Glu Arg Cys His Asn II© Met lys Val Ala Gin 
355 870 67S SBO 

Ala Lya Leu Glu Met lie Lya Pxd Glu Glu Val Asn Leu Glu Glu Tyr 
835 630 695 

Glu Glu Txp Hi* Gin Ajsp Tyr Ar§ Lys Phe Arg Glu Thr Thr Met Tyx 
900 905 910 

Leu 11c lie Ely Leu Glu Abji Phe Gin Arg Glu S&r Tyx lie Asp 3er 
515 920 325 

keu Leu Kfte Leu He Cys Ala Tyx Gin Aan Abfi Lys Glu Leu Leu Scr 
950 &3S 540 

iiys Gly Lbu Tyr Arg Gly His Asp Glu Glu Leu 31c Ser Hi* Tyr Are 
945 955 

Aro. Glu Cye Leu Leu Dys Leu Asn Glu Gin Ala Ala Glu Lbu Ph© Glu 
965 9*70 &75 

sex- Gly Glu Asp Arg Glu Val Asn Asn Gly Leu lie 11© Met Asn Glu 

990 SB5 590 

Pile lie V^l Pro Phe Leu Pro Leu; Leu Leu Val Asp Glu Met Glu Glu 
995, lOOO 1005 

Lya Asp He Leu Ala Val Glu Asp Mat Axg Asn Axg Txp Cys Ser Tyr 
1010 1&1& 1G2Q 

Leu Gly Gin Glu Mat Glu Pro Ri& Leu Gin Glu Lys L«u Thr A»p Phe 
1025 1030 1035 1040 

Leu Pro Lys Leu Leu Asp Cya Ser Met Glu lie Lya £e* Phe His Glu 
1045 1Q50 IvW 

Pro Pro l>ys Leu Pro Sex Tyr Ser Thr Bis Glu Leu Cys Glu Arg *he 

1063 1070 

Ala Arg lie Met, lieu Sex Leu Sex Arg Thr Pro Ala Asp Gly Arg 

1075 ioao ioas 
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<210> 6 
<2L2> 

<2i3> Eamo sapiens 
<400> 6 

aCa^teggcg tLt/tcgccgcc tgcccgDggt gcccgcgcac gccggDcgcc atcgccttcg €0 
cgcctggctg gcgggggcgc tgtcctecca. ggccgtcegc gcegctccct ggagctcggc i£0 
ggagcgcggc ag cc a ojgg e e •ggeggaggcg' <;$*ggagccg ggc$c-c*ceg cegcegecgc IBQ 
e^rcc-gccgcc gcgggggcca fcgaccgtgga gcagaacgtg ct,gcagcaga gcgcggcgca 240 
gaagcacca<g cagacgt/ttt tgaatcaact gagagaaatt acggggatta atgacaDcca 300 
gatactacag caiqccttga aggatagtaa t9^a,«ettg ga,Att±geag tggctttcct 360 
tactgcga^g a at gct-a a.9<a cccctcfltgca ggaggagaca act tact ace a a« m tfCAc: t 42 Q 
tcctggcaat. gatagataca, tcagtgtggg aagecaagea gatacaaatg tgattgatct 430 
Cftctggagat gataaagatg atcttcagag ageaattgee ttgagtttgg. ccgaatcaaa 5-40 
cagggcattc agggagactg gaataactga tgaggsacas gcca-ttagca gagtt-cttga 600 
agecagcata gcfl^gaaLa aageatgttt gaagaggato cctocaga.a.9 tttggagggs 6&G 
ttct^gaaac ccttatgata gaaaaagaca ggacaaagct cccgt-tgggc taaegaetcit 720 
tggcaa.ta.tt tgttggttHa gtgctgttat tcagtcatta tttaatcttt tggaatttag 760 
aagattagtt ctgiattaea agecti'^tc Aait^Cteaa. <ga,tt t^cccc gaaaccaaaa 640 
ggaa cafe egg *atttgcctt ttatgcgtga getgaggtat ttatttgeac ttcttgttgg 900 
taccaaaagg aacjtatgttg atCC±tC*ag a9Ca,<Jttgaa attcttaago; atgetttesa 960 
atcaaatgac tcacagcagc aogatgtgacj tgagtttaca cacaaattat tagattg^t t 102 0 
a^ags tgee- ttc-c-aaatga aagctgaaga ggagaeggat gaagagaagc caflagnaccc 10 BO 
catggta.^ag' ttgttctatg gcagatt&ct ggctgtggga gtacttgaag gtaaaaaatt 1140 
tgaaaacaot ga*.atgtttg gtcagtaccc ac tt ta^l pte a.otMJttca aagatrctgea 120 d 
tgagtgecta gaagctgcoa tgattgaagg agaaattgag tctttacatt ^ApaAtte 1260- 
aggaaaarca ggecaagage attggtttac tgaattacca cctgtgttaa cottt^Mtt 1320 
gtcaag^ttt gaatttaatc aggcattggg aagaccagaa aaaattcaca acaaattaga 1360 
atttDcccna gttttatatt: tggacagata catgeacaga sacagagaas taacaagaat 1440 
taagagggaa g*gatcaaga gactg&aaga t to t tac g 3ta.tta.caac aaaggctaga 15DO 
aagatattta agctatggtt ccggtcccaa acgattcccc ttggtagatg ttcfctcagta 
t^cattggaa tttgcctcaa gtaaacctgt ttgeacttet cctgttgacg atattgaege 1620 
taflttcecca actagtggtt ccataccatc acagacatta ccaagcacaa cagaacnaca 1690 
gggagcect* tcttcagaac trgccaagcac atcaccttca tcagttgctg ccstttcatc 1740 
gagatcagta atacacoooc: catttaetca. gtcceggota cctccagatt tajceeatgea 1800 
tccgqeacca sggcacataa eggaggaaga acttt-ctgt-g ctggaaagtt gtfctacatcg 10 60 
ctggaggaea gaaatagaaa atgacaccag agatttgeag gaaagcatat ccagaatcca 1920 
tcgaacaatt gaattaatgt a c c ct gac a^ atctatgata, e*a$ttectt at^gattaca J&SD 
tgccgtttta gttcacgaag gecaagrta^ tgct^flcac tcctgggcot atatttttga 2O40 
ttatcgtgaa agcagatgg« tgaagtacaa tgatattget gtgacaaaat cotcatg^ga 2100 
ag&gct&gtg agggactctt ttggtggtta tagaaatgee agtgcatact gtttaatgta 2160 
cataaat^&t Ba^gcacegt tcctaataca agaggagttt aa taaagaaa ctgggcagDC 2220 
ccttgttggt atagaaacat toccaccg?© tttgagagat tttgttgag? a* fla, caac Ca, 22 50 
ac^atttgaa aaagaactag aagaatggga tgeacaaett geccagaaag etttgeagga 2340 
aaegctttta gegtctcaga aattgagaga gtcagagact tctgtgacaa cagcacaagc 2400 
agcaggagac tcagaatatc tagagcagee atcaagaagt gatttctcaa agcacttgaa 2460 

11 




WO 0WTO67 



agaasaaact attcaaatsa ttaccaaggc atcacstgag catgaagata oa*,?tcetga 2520 
aaDagttttg Miflftcggcaa ttaagttgga atatgcaagg ttggttaagt tggcccaaga 2£b0 
t^cacccc3 ccagaaaccg attafccgttt. acatcatgta gtggtctaet tit atccagaa 2640 
cc&ggcacca aagaaaatta ttgagaaaac attactagaa caatttggag atagaaattt 2700 
gag^tttgat 9* *a ggt: gtc acaacataat gaaagttflcrt caagccaaac tggaaatgrat 2760 
aaaacctgaa gaagtaaact tgg^aate tgaggagtgg catcaflgatt ataggaaatt 2S20 
cagg2*** ca actatgtatc tcataattgg getagaaaat ttrtcaaagag aa«e"atat 23 BO 
ag attest* 9 < t<gttc etc a tctgtgctta tcagaataa* aaaga&Gtct tgtctaaagg 
cttatacaga ggacatfcfite wgaattgat atcacattat ag**ga_gaat gtttgctaaa 3000 
attaaatgag caagccgcn^ aaetettcga atctggagag gatcg*gaag taaaDaatgg 5050 
ttt^ettatc atgaatgagt ttattgtccc atttttgeca ttattactgg tggatgaaafc 3120 
ggaagaaaag gatatactag ctgtagaaga tatgagaaat egatggtgtt. -cctaccttgg 31S0 
tcaagaaatg gaaccacae? teceageaaa gctgacagat tttttgecaa aactgettga 3240 
ttgttctatg gagattaaaa gtttccatga geeaccgaag ttac^ttcat attccacgca 330O 
tgaactctgt ^agtgatttg cccgaatcat gtt^ttCCtc agtcgaactc ctg^t^atgg 3360 
aagataacct gcacactttc CDtgaacaca ctgtatiaac tctttttagt ■ tctta&ccct 342 0 
tgccttcctg tcaeagggtt t get tgt tgc tgctatagtt tttaactttt ttttat-ttta 3 450 
etaactgeas aagacnaaat gactatacag actttagtca gacttftagae aaeaaagcfcg 3540 
5 aiatcgcat ggegctcaga catttta«ee ggaactgatg tataatcaC* aatctaattg 3600 
attttattat ggcaaaacta tgettttgee a ee ttc ctg t tgcagtatt* etttgetttt 3-660 
atcttttctt t-Ctcaacagc tttccattca gtctggatcc tteeatgact acagecattt 5720 
aagtgttcag ^aetgtgtsc: gatacataat atttggtGJC ttgtaaatga aataa«eraat 37S0 
saagttttat ttatggCtac eta 3B03 



<2L0> 7 
<211> 3169 
<£i2> DMA 

<±13> Homo sapiens 
<400> 1 

catgacegtg gagcaga.**S tgetgeagea gagegeggeg ca^aagcacc agcagactftt 60 
tttgaateaa ctgagagaaa ttaogggfcAt taatgacacc cagatactax ageaagcett 120 
gaaggatagt aatggaaact tggaattagc a$tggetttc ettactgega a^aatgetaa 180 
gac^CCteag caggaggaga caacttaefca waaacagca cttcctggca atgatagata 240 
catcagtfct? ^gaagceaag cagatacaaa tgtg«tt^at ctcactggag atgatftaaga 300 
tgatcttcag agagwattg eettgagttt ggecgaate* apeagggcat tcagggaflac 360 
tggaataact gatgaggaac aagecattag cagagttctt gaageca^ea tagcagagaa 420 
taaagcatgt ttgaagagga cacctacaga agtttggagg gattctcgaa accet.tat.ga 4 BO 
tagaaaaaga eaflgaeaaag <:tcccgt:i:gg gctaa<jgaat gttggcaata cttgtt<3gtt 540 
tagtgctgtt attcafltcat tatttaatct tttgga4ttt agaagattag ttctg«atta 600 
caagcctDca teaaatgetc aaga.tttacc ccgaaaccaa aaggaacatc o/gaatttgee 
ttttatgegt gagctigaggt atetatttge acttcttgtt ggtaccaaaa ggaagtatgt 72 0 
tgatccatc* ai^agcagttg aaattcttaa ggat?ett:tc aaatDaaatg actc^cagca 7S0 
gcaagatgtg agtgagttt* caeacaaatt attagattgg ttd^aagatg ccttcc«<iat a40 
gaaagctgaa gaggagaegg ©tgaa^agaa gecaaagaac cccatgfltag agttgtt-ct* 90U 
tggcagattc otggctgtgg ga@taet.tiga aggtaaaaaa fcttgaaaace ctgaaatgtt 9^0 
tggtcagtac ccacttcagg tcaatpggtt eaaagatctg catgagtgc^ t *ga agetge 102« 
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